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EDITORIAL 


A funding mosaic for loss and damage 


n the context of climate policy, “loss and damage” re- 
fers to unavoided climate change impacts, including 
those from extreme weather events and slow-onset 
events such as sea-level rise, increasing temperatures, 
and loss of biodiversity. Some argue that focusing on 
loss and damage distracts industrialized countries 
from attempts to avoid losses and damages in the 
first place through decarbonization and adaptation. Oth- 
ers argue that insufficient mitigation by industrialized 
countries has unleashed an unfair burden on develop- 
ing countries, so developed countries alone must address 
loss and damage financially. The impasse was broken at 
the 27th Conference of the Parties (COP27)—the interna- 
tional climate negotiations—in November 2022, through 
a decision on new funding arrangements for loss and 
damage, including a new fund. Re- 
flecting on the funding needs, Mal- 
dives’ Environment Minister Ami- 
nath Shauna argued at COP27 that 
the world requires a “mosaic of so- 
lutions.” As a transitional committee 
begins to assemble recommenda- 
tions to operationalize this mosaic 
of funding arrangements, several 
critical questions require insights 
from the scientific community. 

To address major needs, what 
solutions can resolve gaps? Climate 
change is anticipated to have en- 
during effects on the real economy, 
manifesting as depressed production, decreased em- 
ployment, diminishing social coherence, and cultural 
loss. All of these stress societies and public and private 
budgets. Climate-related emergencies could involve dis- 
placement or relocation (planned or unplanned) and 
may necessitate climate-resilient reconstruction and 
recovery. Slow-onset events may involve redistribution 
of populations through migration. This will require in- 
sights from social, economic, and climate data. Address- 
ing these needs should improve peoples’ situation, not 
keep them at the brink of survival. 

What steps will improve coherence and coordination 
across the landscape of global, regional, and national 
institutions that fund activities to address loss and 
damage? Support in response to climate-related emer- 
gencies is currently channeled through humanitarian 
funding; emergency payouts inject cash and support 
for temporary shelter, food, and basic life support. 
Debt instruments such as loans, bonds, and budget 
reallocations currently finance reconstruction and 
recovery. But future climate impacts are needed that 


“= Improve 
peoples’ situation, 


not keep 
them at the brink 
of survival.” 


foster demand for public and private investments in 
ways that buoy people’s ability to navigate disruptions 
across their life cycle, such as investments in health, 
vocational education, social safety nets, and restora- 
tion of ecosystems. 

Regional and global monetary coordination will 
increasingly be invoked to provide fiscal space for 
governments when disruptions affect their finances. 
For example, regional financing arrangements allow 
groups of countries to pledge mutual financial support 
to members experiencing financial difficulties in their 
regions. Special drawing rights from the International 
Monetary Fund can help stabilize exchange rate vola- 
tility, which can be a measure of a country’s economic 
health. Considerable research is needed on economic 
tools that provide stability in spite 
of disruptions. 

How can current financial mo- 
dalities better address needs and 
promote stability? New and ad- 
ditional capital from a new fund 
alone will not solve the loss and 
damage finance issue. The deci- 
sion recognizes that a variety of 
mechanisms—including through 
international financial  institu- 
tions, multilateral development 
banks, central banks, and social 
safety net programs—are required 
to shape and channel support to 
address needs. Research is necessary for the design of 
systems and structures to provide stability and enhance 
well-being in the face of progressive upheaval and un- 
certainty. Operationalization of funding arrangements 
raises questions about how to adjust timing, eligibility 
requirements, and access to promote stability. Informa- 
tion is needed about funding gaps, and creative and ef- 
fective solutions for filling those gaps to help vulnerable 
populations and ecosystems on which they depend. 

The scientific community can bolster operation- 
alization of loss and damage funding arrangements 
through insights about these three questions. Analysis 
about the real economy will deepen understanding of 
the full dimensions of escalating, nonlinear impacts. 
Developing this mosaic of solutions means harness- 
ing the strength of the global economy to deliver sta- 
bility in robust, flexible ways that can anticipate and 
address societal needs in the face of advancing ad- 
verse climate impacts in every region of the world. 


—-Koko Warner and Michael Weisberg 
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CLIMATE SCIENCE 


In 2022, Earth set new records for warming 


emperatures continued to rise at an alarming 
pace in 2022, which became the fifth- or sixth- 
hottest year in modern history, U.S. and European 
science agencies reported last week. Earth’s av- 
erage recorded surface temperatures were some 
1.2°C warmer than preindustrial times. Nearly 
30 countries set individual all-time heat records, and 
some 850 million people experienced the warmest 
temperatures of their lives last year. As in 2021, the 


warming was suppressed by a persistent, multiyear 
La Nifia cooling pattern in the tropical Pacific Ocean, 
the agencies said. But La Nina is expected to wane this 
year, setting the stage for even higher temperatures. 
Meanwhile, the world’s oceans, which capture 90% 
of the excess heat from global warming and are less 
prone to short-term temperature fluctuations, again 
had their hottest year on record in 2022—as they have 
nearly every year since the 1990s. 


China reports COVID-19 deaths 


PUBLIC HEALTH | China’s government said 
last week that nearly 60,000 people have 
died after contracting COVID-19 since it 
abandoned its zero-COVID policy on 

7 December 2022—a major departure from 
previous assertions, deemed not credible 
by outsiders, that fewer than 10 people 

per day died since the policy ended. The 
new tally includes hospitalized patients 
for whom COVID-19 was either the direct 
cause of death or a contributing factor, a 
National Health Commission official said 
at a 14 January press briefing. The aver- 
age age of the deceased was 80, and more 
than 90% had underlying conditions. The 
official added that 300,000 COVID-19 
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patients have been hospitalized and that 
infections are now tapering off. 


Hungary protests funding freeze 


POLitics | Hungary has vowed to fight an 
EU decision to suspend millions of euros 
for research and higher education as part 
of an ongoing dispute about control of 

the country’s universities. In December 
2022, the European Union notified the 
Hungarian government that it would 
exclude 21 of the country’s three dozen uni- 
versities from the Horizon Europe research 
funding system and from the Erasmus pro- 
gram, which funds international student 
exchanges. The European Union says the 
universities’ oversight system, introduced 


by the current government, fails to meet 
EU standards because it allows politicians 
to sit on the schools’ governing bodies. 
Since 2021, Hungary has received roughly 
€60 million from Horizon Europe to fund 
nearly 200 projects, and in 2020, more 
than 20,000 Hungarian nationals received 
roughly €40 million in Erasmus grants. A 
government spokesperson said last week 
that Hungary would sue to reinstate the 
funding if ongoing negotiations fail. 


Japan aims to boost Ph.D. jobs 


woRKFORCE | Alarmed by the country’s 
sagging industrial prowess and a dearth of 
jobs for Ph.D. holders, the Japanese govern- 
ment will offer tax breaks for corporations 
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Unusually warm weather this 

winter has forced ski resorts 
across Europe, like this 

one in Austria, to make snow: _ 


hiring recent doctoral graduates. Starting in 
April, companies will be able to claim a cor- 
porate tax credit worth 20% of salaries and 
other costs associated with hiring research- 
ers who've earned a Ph.D. within the past 

5 years. To qualify, companies also have 

to boost the share of R&D salaries going 
to staff with a doctoral degree by at least 
3% annually. Ph.D. holders face bleak job 
prospects in Japan. Companies prefer hiring 
people with master’s degrees and training 
them in-house, and the number of research 
positions at Japan’s publicly supported 
universities is effectively capped. 


Reef sharks face extinction 


CONSERVATION | Sharks living near 
coral reefs are nearly twice as likely to 
be threatened with extinction as sharks 
in general, according to a new analy- 
sis. Many shark species are overfished 
for their fins and meat, which led the 
Convention on International Trade in 
Endangered Species of Wild Fauna and 
Flora in November 2022 to regulate the 
exports of about 10% of all 1199 species 
of sharks and rays. Fifty-nine percent of 
the 134 species of reef-dwelling sharks 
and rays are in danger of extinction, 
primarily from overfishing but also from 
climate change and habitat degradation, 
researchers report this week in Nature 
Communications. They call for better 
enforcement of fisheries regulations and 
an increase in marine protected areas. 
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Court nixes defamation verdict 


#METOO | An appeals court in Peru has 
nullified a lower court’s verdict against 
archaeologist Marcela Poirier, who was con- 
victed of defamation after publicly accusing 
prominent archaeologist Luis Jaime Castillo 
Butters of sexual harassment. Castillo 
Butters had sued Poirier after she reported 
on Facebook that Castillo Butters’s employer, 
the Pontifical Catholic University of Peru, 
had found evidence of harassment. In May 
2021, Poirier was given a $48,400 fine and 

a 20-month suspended jail sentence, which 
she appealed. The appeals court ordered a 
retrial after finding that the proceedings 
were “plagued with irregularities,” according 
to Poirier’s lawyer. Castillo Butters did not 
respond to requests for comment. 


Laser guides lightning 


PHysics | Like the Greek god Zeus aim- 
ing thunderbolts, physicists working on a 
mountaintop in Switzerland have used a 
high-power laser to steer lightning toward 
a 124-meter-tall radio tower. The advance 
marks the culmination of decades of efforts 
to show that a laser can drill an ionized 
channel through the atmosphere that pro- 
vides a path of least resistance for lightning 
to follow. The research team steered the 
bolt over the final 50 meters before impact. 
The finding, reported this week in Nature 
Photonics, could open the way to using 
lasers to create “virtual lightning rods” that 
protect rocket launchpads and other sensi- 
tive infrastructure by preemptively draining 
away the threat. The technique could shield 


A mountaintop laser was fired 1000 times per second 
to create a channel that guided a lightning bolt. 


39 
Jacob Carter of the Union of Concerned 
Scientists, about a scientific integrity policy 
unveiled last week by the Biden 
administration to safeguard U.S. government 
scientists from political meddling. 


an area larger than that covered by conven- 
tional lightning rods, the researchers say. For 
most applications, however, the multimil- 
lion-dollar laser system likely won’t replace 
those humble and far less costly devices. 


JWST glimpses its first exoplanet 


ASTRONOMY | NASA’s JWST orbiting 
observatory has made its first discovery 
of a previously unknown planet around 
another star and is searching for its 
atmosphere. The newly discovered world, 
dubbed LHS 475 b, is almost exactly 
Earth-size. Other scopes can parse the 
atmospheres of gas giants from starlight 
passing through, but the atmospheres of 
small rocky exoplanets present a much 
harder challenge; researchers hope JWST 
will help crack it. The discoverers of LHS 
475 b told the American Astronomical 
Society last week they will need more 
observations to pinpoint whether it has an 
atmosphere, and if so, its composition. 


China’s population starts to fall 


DEMOGRAPHY | It’s official: China’s popula- 
tion has started to shrink, a turning point 
that could herald major economic chal- 
lenges. In 2022, the number of Chinese 
residents—excluding those of Hong Kong, 
Macau, and Taiwan—fell by 850,000 people 
to 1.4 billion, China’s National Bureau 

of Statistics announced this week. The 
drop, the country’s first since the 1960s, 
was computed before the current wave of 
COVID-19 deaths began and may have been 
accelerated by couples deciding against 
having children during the pandemic. 
Demographers had long predicted the 
decline and expect it to be a long-term 
trend. China’s birth rate has fallen for years, 
even after the government ended its one- 
child policy in 2016, and is now among the 
world’s lowest. One reason is the large-scale 
migration to cities, where raising children 
is expensive, demographers say. 
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GLOBAL HEALTH 


Taliban decree deepens Afghanistan's health crisis 


A hospital ward for severe 
malnutrition in Kabul. Six 
million people in Afghanistan 
are on the brink of famine. 


Ban on female NGO staff hampers critical aid work and jeopardizes polio eradication campaign 


By Leslie Roberts 


decree issued by the Taliban last 

month that bars women from work- 

ing in national and international non- 

governmental organizations (NGOs) is 

another devastating blow to women’s 

rights in Afghanistan. It also threat- 
ens to plunge a country beset by hunger and 
natural disasters even deeper into a public 
health crisis. 

The United Nations and its humanitar- 
ian partners are engaged in intense nego- 
tiations to persuade the Taliban to reverse 
the edict. But for now, many NGOs, which 
depend heavily on female staff, have made 
the agonizing decision to suspend their op- 
erations, which provide vital food, hygiene, 
and medicines. The decree also jeopardizes 
the global campaign to eradicate polio, in 
which women play crucial roles. Afghani- 
stan and its neighbor Pakistan are the last 
two countries where the wild poliovirus is 
endemic, and the campaign is going full 
bore to wipe it out by the end of this year. 

Since August 2021, when the extremists 
took over, Afghanistan’s economy has col- 
lapsed and the country has been hit with 
a punishing drought, an earthquake, floods, 
and a brutal winter. More than 28 million 
people are in need of humanitarian aid, 
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such as food, shelter, clean water, hygiene, 
warm clothes, child protection, education, 
cash transfers, immunizations, and essen- 
tial health services. Some 6 million are on 
the brink of famine. 

Afghanistan’s Ministry of Economy pro- 
nounced the edict in a 24 December let- 
ter to the Agency Coordinating Body for 
Afghan Relief and Development (ACBAR), 
which has 183 national and international 
NGOs as members. The letter claimed some 
female NGO staff were not wearing the hi- 
jab properly, says ACBAR Director Fiona 
Gall, and said it would revoke the operat- 
ing licenses of organizations that did not 
comply with the ban on female staff. (The 
letter came after the Taliban barred women 
from universities and girls from secondary 
schools.) The Taliban now visits NGO offices 
regularly to see whether any women are at 
work; if so, they may get arrested. Women- 
led NGOs are under particular scrutiny. 

For cultural reasons, Afghan women 
can’t interact with male aid workers. By is- 
suing the decree, “The Taliban has in effect 
suspended aid for half the population of 
Afghanistan,’ Sima Bahous of UN Women 
said in a 25 December statement, adding 
that “11.6 million women and girls are no 
longer receiving vital assistance.” And many 
national and international NGOs say they 


had no choice but to temporarily suspend 
some or all operations. A quick survey of 
87 NGOs conducted by the Humanitarian 
Access Working Group on 12 January found 
that 83% had done so. 

“We are in a really difficult spot. We have 
no intention of abandoning the communi- 
ties we work with,” says Keyan Salarkia of 
Save the Children Afghanistan, which has 
more than 5000 staff, about half of them 
women. But, “We can’t reach women or chil- 
dren and can’t keep our staff safe. That is 
not a compromise we could make.” 

Kochay Hassan, executive director of the 
Afghan Women’s Educational Center, says 
her organization continues to operate, but 
with most women working at home. The 
decree “will not stop us ... if women can 
still work from home and engage with our 
male colleagues,” Hassan says. The Vision 
Development Organization, which works 
on issues such as women’s empowerment, 
girls’ education, and health promotion, is 
also persevering, says founder and Execu- 
tive Director Madina Mahboobi. 

The United Nations’s own agencies are 
technically exempt from the ban but could 
still see their work crippled by it. The UN 
World Food Programme (WFP), for exam- 
ple, which is aiming to feed 15 million peo- 
ple through the winter, works with about 
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100 local partners. “The whole humanitar- 
ian community is affected by the decision 
in one way or another,” a WFP spokes- 
person says. 

Afghanistan’s Ministry of Public Health 
has said female NGO workers in the health 
sector are exempt from the decree as well, 
but exactly what that statement means is 
unclear, Gall says, and many organizations 
worry about the safety of their female staff. 
It appears that female health staff can work 
in hospitals and clinics, but the status of 
mobile teams is less clear, Gall says. 

As a workaround, Hassan and others 
have adopted new terminology for their 
NGOs’ work. In a number of provinces, “safe 
spaces” for women and children, which 
provide health interventions among other 
services, are now “health centers” that con- 
tinue to operate with female staff, who care- 
fully follow the Taliban’s dictates on dress 
code and male chaperones. 

Leaders of the Global Polio Eradication 
Initiative (GPEI) are trying to clarify what 
the edict means for their work. With just 
two cases caused by the wild poliovirus in 
Afghanistan in 2022, they say they have no 
intention of letting up. Two days after the 
edict came out, GPEI went ahead with a 
campaign in four eastern provinces, using 
female vaccinators. But UNICEF decided 
not to send its female staff, who inform 
women about upcoming campaigns and 
their benefits. “That is not sustainable go- 
ing forward,’ says Hamid Jafari of the 
World Health Organization, who directs po- 
lio operations in the region. 

GPEI is planning for a large campaign 
targeting 5.3 million Afghan children in 
late January, Jafari says. “We are working 
closely with the ministry of health to en- 
sure that women can stay in campaigns.” If 
not, campaigns will proceed with male vac- 
cinators, as they already do in parts of the 
country. Fewer children will be vaccinated, 
especially babies who can’t be taken out of 
the house, Jafari says, “But we will go ahead 
and try our best.” 

The U.N. and humanitarian organiza- 
tions are still hoping the decree will be re- 
versed. The U.N. Resident Coordinator in 
Afghanistan, Ramiz Alakbarov, met with 
Qari Din Mohammad Hanif, the Taliban 
minister of economy, on 26 December, and 
discussions have continued. Observers note 
dissent among the Taliban; some ministers 
have said they don’t agree with the decree. 
And the Taliban seems to have left the door 
open, at least partially, to more discussions. 
“That means there is room for compromise,” 
Gall says. “But it might not happen as soon 
as we would like.” 


Leslie Roberts is a journalist in Washington, D.C. 
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PEER REVIEW 


To reduce ‘reputational bias, 
NIH may revamp grant scoring 


Proposed rating system seeks to minimize 
influence of applicant’s resume and workplace 


By Jocelyn Kaiser 


esearchers familiar with the National 

Institutes of Health’s (NIH’s) peer- 

review process can cite many cases 

where the reputation of a scientist or 

their workplace seemed to count for 

more than the strength of their ideas. 
There was the scientist who moved from the 
Ivy League to a large public university and 
saw scores on his grant applications drop. 
The investigators from historically Black in- 
stitutions who always seemed to do worse 
than applicants from majority white schools. 
And the big names from research power- 
houses whose proposals won stellar ratings 
without much scrutiny. 

“Anyone who actually attends 
a study section sees [reputa- 
tional bias] happening,” says 
Noni Byrnes, director of NIH’s 
Center for Scientific Review 
(CSR), referring to NIH peer- 
review panels. Now, in a bid 
to reduce that bias, her center 
is proposing the first overhaul 
of NIH’s scoring system in 
14 years. But the idea is getting 
a mixed reception. 

The proposed reform was 
prompted, in part, by a strik- 
ing pattern in NIH awards: About 70% of 
grants go to just 10% of all NIH-funded 
institutions. NIH says both anecdotes and 
data suggest reputational bias is partly re- 
sponsible. So, in 2020, a working group of 
CSR’s advisory council began to examine 
ways to reduce this bias, as well as ease the 
workload for reviewers. 

Currently, reviewers rate proposals on sig- 
nificance, investigator, innovation, approach, 
and environment. The reform proposal, re- 
leased last month, would reorganize those 
five criteria into three factors. Two would be 
scored: importance, and rigor and feasibility. 
Reviewers would also consider, but not score, a 
third factor: expertise and resources. If the re- 
viewer sees problems in these areas—because 
the applicant lacks relevant experience, for 
example, or access to key equipment—the 
reviewer could adjust the overall score and 
write a brief explanation for NIH staff. 


“There is value 
insome 
objective score 
... to help 
to assess 


the investigator.” 


Shelley Berger, 
University of Pennsylvania 


Some researchers worry these changes 
would go too far. To Shelley Berger, an epi- 
genetics researcher at the University of Penn- 
sylvania and a member of the NIH director’s 
advisory panel, de-emphasizing an appli- 
cant’s background is a mistake. “There is 
value in some objective score to try to help 
to assess the investigator,’ she said at the 
panel’s December 2022 meeting. Rather than 
remake the scores, NIH could use “other 
methods to try to help underrepresented 
investigators and institutions to get into the 
NIH funding system,” she wrote in an email. 

In contrast, others think the changes 
don’t go far enough. They say NIH should 
simply blind reviewers by preventing them 
from seeing an applicant’s iden- 
tifying information, at least 
initially. NIH is already experi- 
menting with such blinded re- 
views for a limited number of 
grants. In the first year, the ap- 
proach appeared to result in a 
more diverse pool of applicants 
for those grants and greater ra- 
cial diversity among awardees. 
Some other U.S. research agen- 
cies, such as NASA, are also 
testing blinded reviews. 

At NIH, Byrnes says imple- 
menting blinded reviews on a 
large scale would be impractical, because 
anonymizing proposals is “unwieldy” 
and reviews take longer. In addition, the 
agency has found in some tests of blind- 
ing that reviewers could still identify about 
20% of the applicants. 

NIH is now collecting comments on the 
proposal, with a 10 March deadline. CSR 
expects to release the final review criteria 
this summer. 

If NIH adopts the new system, it would 
launch in 2025. The agency should monitor 
whether it is having any impact, says drug 
abuse researcher Michael Taffe of the Uni- 
versity of California, San Diego, who has 
long criticized NIH for not doing enough to 
narrow a funding success gap between Black 
and white scientists. “NIH has a tendency to 
put these policy changes in place and not ... 
determine if they are producing the desired 
outcome,” he says. 
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Stem cell factors reverse signs 


of aging in mice 


Will reprogramming technique one day help people? 


By Catherine Offord 


decade after Kyoto University 

biologist Shinya Yamanaka won a 

share of a Nobel Prize for discovering 

a cocktail of proteins that reprogram 

adult cells into versatile stem cells, 

two teams argue the proteins can 
turn back the clock for entire organisms— 
perhaps one day humans. One group at a 
biotech used gene therapy to deliver some 
of the so-called Yamanaka factors into old 
mice, and modestly extended their life span. 
A separate team followed a similar strategy 
to reverse aging-like changes in genetically 
engineered mice. 

In both cases, the Yamanaka factors ap- 
pear to have rejuvenated part of the ani- 
mals’ epigenome, chemical modifications 
on DNA and proteins that help regulate gene 
activity. But other scientists say suggestions 
of age reversal are premature. “These studies 
use reprogramming factors to reverse epigen- 
etic changes that happen during aging,” says 
Matt Kaeberlein, a geroscientist at the Uni- 
versity of Washington, Seattle, but that’s a far 
cry from making an old animal young again. 

Several groups had already found that 
mice genetically engineered to begin ex- 
pressing Yamanaka factors in adulthood 
show reversal of certain aging symptoms. 
To explore an approach that might lead 
to a more practical treatment for people, 
San Diego-based company Rejuvenate Bio 
injected elderly (124-week-old) mice with 
adeno-associated viruses (AAVs) carrying 
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genes for three of the factors, collectively 
known as OSK. 

These animals lived another 18 weeks on 
average, compared with 9 weeks for a control 
group, the company reported in a preprint 
on bioRxiv this month. They also partially re- 
gained patterns of DNA methylation—a type 
of epigenetic mark—typical of younger ani- 
mals. Although some studies have suggested 
Yamanaka factors can promote cancer, Noah 
Davidsohn, Rejuvenate’s chief scientific offi- 
cer and co-founder, says the company has so 
far seen no obvious negative effects. 

“T would say it is provocative—possibly a 
breakthrough,” says Steven Austad of the 
University of Alabama, Birmingham, who 
studies the biology of aging. “But it will need 
to be replicated and the mechanism explored 
before we can say for sure.” 

The second study, published in Cell last 
week, is from a team led by Harvard Medi- 
cal School geneticist David Sinclair, who 
has backed several controversial “anti-ag- 
ing” interventions. (Rejuvenate’s approach 
grew from an earlier collaboration between 
Sinclair and Davidsohn, but Sinclair isn’t in- 
volved in the company’s research, Davidsohn 
says.) Sinclair’s team set out to test his “in- 
formation theory of aging,” which posits that 
our bodies get old because of the cumulative 
loss of epigenetic marks. Cells’ DNA repair 
mechanisms, operating throughout a lifetime 
to fix DNA cuts and other damage, are what 
degrade these marks, he argues. 

To test the theory in mammals, the team 
genetically engineered mice that, when given 


A16-month-old mouse with induced DNA breaks 
(right) looks much older than a normal counterpart. 


a drug, make an enzyme that cuts their DNA 
at 20 sites in the genome. The animals’ cells 
faithfully repaired the damage. But wide- 
spread changes in cells’ DNA methylation 
patterns and gene expression followed, con- 
sistent with Sinclair’s theory. The mice ended 
up with an epigenetic signature more like 
that of older animals, and their health deteri- 
orated. Within weeks, they lost hair and pig- 
ment; within months, they showed multiple 
signs of frailty and tissue aging. 

To see whether the epigenetic degradation 
was reversible, the researchers injected some 
of these seemingly aged mice with AAVs car- 
rying OSK genes, which Sinclair’s group re- 
cently reported could reverse loss of vision in 
aging rodents. Analyses of the mice’s muscles, 
kidneys, and retinas suggest the cocktail re- 
versed some of the epigenetic changes in- 
duced by the DNA breaks. The findings point 
to a way to drive an animal’s age “forwards 
and backwards at will,” Sinclair says, and sup- 
port the idea of epigenome-targeting treat- 
ments for aging in humans. 

Molecular biologist Wolf Reik, director of 
the rejuvenation-focused Altos Cambridge 
Institute of Science, praised the sophistica- 
tion of the Harvard team’s study, but says the 
team’s indirect way of inducing epigenetic 
changes with dramatic DNA breaks makes it 
hard to prove those changes are what’s caus- 
ing aging. The DNA breaks could be taking 
their toll through a different route. It’s also 
unclear how well mice with induced DNA 
breaks mimic naturally aging animals, says 
Jan Vijg, a geneticist at the Albert Einstein 
College of Medicine. 

He and others stress that aging is a com- 
plex process with multiple contributing 
factors, and that in both papers, the effects 
of OSK treatment were moderate: a small 
extension of life span in one, and a partial 
reversal of artificially induced symptoms 
in the other. “The jump that now aging is a 
program” that can be wound backward isn’t 
justified by the research, Vijg says. 

Still, both groups want to move their work 
toward the clinic. Rejuvenate is examining 
the mechanisms underlying the treatment’s 
action and tweaking its delivery and com- 
position, Davidsohn says. “OSK might not 
be the final set” of factors, he adds. Sinclair 
says his team is already testing AAV-delivered 
OSK in the eyes of monkeys. “If those studies 
in monkeys go well and everything looks safe 
enough for humans, the plan is to immedi- 
ately apply to the FDA [Food and Drug Ad- 
ministration] to do a study in one or more 
[age-related] diseases of blindness.” 


Catherine Offord is a science journalist in Barcelona. 
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WORKFORCE 


NSF under fire for not tracking sexual orientation 


In major demographic survey of scientists, agency will only query gender identity 


By Katie Langin 


he U.S. National Science Foundation 
(NSF) says it does not plan to include 
a question about sexual orientation 
in a major national workforce survey, 
prompting hundreds of researchers to 
send a letter of protest. 

Last month, the agency submitted its 
plans for the 2023 National Survey of Col- 
lege Graduates (NSCG), a biennial survey 
of more than 160,000 U.S. bachelor’s degree 
holders with a focus on the science and 
engineering workforce, to the Office of 
Management and Budget (OMB) for 
approval. Many LGBTQ scientists were 
pleased that the survey will, for the first 
time, include a question about gender 
identity for all respondents. But the ab- 
sence of a sexual orientation question is 
“incredibly disappointing,” says Ramon 
Barthelemy, an assistant professor at 
the University of Utah who has studied 
the experiences of LGBTQ scientists in 
physics. Speaking as a gay man, he says, 
“We have fought so hard for so long to 
try to get representation in the scientific 
community, and what NSF is commu- 
nicating to us is, they don’t want us to 
have that representation.” 

The agency had raised hopes in 2021 
when it pilot tested questions about 
gender identity and sexual orientation 
on the NSCG. Advocates took that as 
a sign that in coming years the agency 
would fully implement those ques- 
tions in its suite of workforce surveys, 
including the Survey of Earned Doctor- 
ates and the Survey of Doctorate Recipients, 
which focus on Ph.D.-educated scientists and 
engineers. Those surveys provide crucial data 
about which demographic groups are under- 
represented in the U.S. STEM community. 

Up to now, respondents have had the op- 
tion of identifying their sex only as “male” 
or “female.” If NSF’s plans are approved, this 
year’s NSCG will split that question into two, 
first asking respondents to specify their sex 
assigned at birth and then their current gen- 
der identity. For the latter, respondents will 
have the option of selecting all that apply 
among male, female, and transgender; typ- 
ing their own descriptor as free text; or both. 
This approach is consistent with OMB rec- 
ommendations released this month detailing 
how to collect gender identity data, though 
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some think the survey should offer more op- 
tions, such as nonbinary and gender-fluid. 
Gender is “quite an expansive spectrum,” 
says Abby Ray, a microbiology Ph.D. student 
at the University of California, Davis, and vice 
president of communications for oSTEM, an 
organization that represents LGBTQ+ people 
in STEM. For oSTEM’s own data collection 
efforts, “We try to give as many options as 
possible so people have a chance to see them- 
selves represented,” they say. 

The addition of a gender identity question 


is a “small step in the right direction,’ says 
Kaitlin Rasmussen, a postdoc in astrophysics 
at the University of Washington, Seattle, who 
has written on collecting data about margin- 
alized genders. But it’s frustrating that sexual 
minorities won’t be counted as well, they add. 
“Gender and sexuality are really deeply inter- 
twined and you might say you can’t consider 
one without the other.” 

NSF told OMB it decided to omit the 
question about sexual orientation because it 
didn’t perform well on the pilot test, taking 
longer to complete than the gender identity 
question and resulting in more changed an- 
swers and respondents exiting the survey. 
But the agency provided no data to back 
up those assertions or show how the results 
compare with those for questions about, for 


instance, race, disability, or income. “If they 
really want to stick to what they’re claiming, 
then they should release the data,” says Jon 
Freeman, an associate professor of psycho- 
logy at Columbia University who has spent 
the past 5 years advocating that NSF collect 
data on LGBTQ scientists. An NSF spokes- 
person wrote in a statement to Science 
that the agency “will make those and other 
methodological findings available to the pub- 
lic later this year.’ 

It’s “perplexing,” says Nancy Bates, a re- 
tired federal statistician. She points 
out that questions about sexual ori- 
entation have been asked in other 
federal surveys for years and have 
generally performed well. “There’s 
less debate and sort of worry about 
measurement error for that particu- 
lar question, as opposed to gender 
identity,” says Bates, who co-chaired a 
National Academies of Sciences, Engi- 
neering, and Medicine working group 
that issued recommendations in 2022 
detailing best practices for measuring 
sexual orientation and gender identity. 

The NSF spokesperson said the 

agency will continue to explore whether 
to add a sexual orientation question to 
future workforce surveys. “The findings 
[from the 2021 pilot test] are already 
helping inform questions for future 
surveys. However, the results indicated 
that additional research and testing are 
needed to ensure it results in accurate 
data that protects privacy,’ the spokes- 
person added. 

Freeman hopes OMB will force NSF’s 
hand. In an effort to sway OMB’s decision, 
due next month, he drafted a 16-page letter 
detailing his concerns about NSF’s plans, 
which has been cosigned by 1700 other re- 
searchers. The letter, submitted to OMB last 
week, argues that NSF hasn’t been transpar- 
ent about its decision-making process and 
that the agency could still move forward with 
a sexual orientation question by borrowing 
methodological approaches that have already 
been extensively tested and vetted by other 
federal agencies, such as the Department of 
Education, the Census Bureau, and the Cen- 
ters for Disease Control and Prevention. 

“[LOMB] could say, there’s no reason that 
youre not also doing sexual orientation 
when that’s what all these other agencies 
have done,” Freeman says. & 
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A cell-by-cell view of a worm’s metamorphosis 


Single-cell analysis reveals shifts in identity as shapeless larva becomes wriggling adult 


By Elizabeth Pennisi, in Austin, Texas 


newborn infant looks unmistakably 

human, with legs, mouth, ears, and 

bottom all in place. The same can’t 

be said about the youngest sea stars, 

worms, or butterflies: Many inverte- 

brates start out looking nothing like 
the adults they will become. Now, research- 
ers have monitored one worm’s larval cells 
during the transformation to adulthood, 
spying on their fates and how their identi- 
ties changed. The work, reported earlier this 
month at the annual meeting of the Society 
for Integrative and Comparative Biology 
here, provides some of the first clear cell- 
by-cell clues about what happens during 
these radical changes in body form. 

“We are beginning to get at how a 
larva produces an adult body plan,’ says 
Christina Zakas, an evolutionary bio- 
logist at North Carolina State University 
(NCSU) who was not involved with the 
work. The study’s reliance on a sophisti- 
cated technique for tracking gene activ- 
ity in single cells is also a milestone. To 
date, most single-cell analyses have fo- 
cused on well-studied “model” animals 
such as the fruit fly, mouse, or zebra- 
fish. “That [the author] got this new 
technology to work on something that 
was not a model organism was just 
stunning,” says Greg Rouse, a marine 
biologist at the Scripps Institution of 
Oceanography who was not involved 
with the work. The work could inspire 
other biologists to study their own fa- 
vorite metamorphosis at the level of 
single cells, he says. 

Invertebrates undergo several kinds 
of larva-to-adult transformations. Cat- 
erpillars become butterflies through 
a well-understood process called cata- 
strophic metamorphosis, in which a 
small group of cells that were dormant 
in the larva multiply and diversify into 
the adult, leaving the dead larval body 
behind. But Paul Bump, now a post- 
doctoral fellow at Harvard University, 
wanted to know what happens in spe- 
cies lacking such a group of cells. 

Bump, who was then a Stanford Uni- 
versity graduate student working with 
evolutionary biologist Christopher Lowe, 
focused on the California broken-hearted 
worm (Schizocardium californicum), a 
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marine worm found in California mud flats. 
The 30-centimeter-long adult grows out of a 
larva that is little more than a sesame seed- 
size blob with an eyespot at one end and a 
band of cilia around its body. Several months 
after starting life, in just 48 hours or so, the 
larva transforms into a tiny juvenile worm 
with bulging mouthparts, a thickened mid- 
section, and a long, sinuous body. 

To track what happens to the larva’s cells 
as it transitions to adulthood, Bump and 
his colleagues first sequenced the worm’s 
genome and figured out how to tease out 
individual cells. He measured gene activ- 
ity in cells isolated from larvae at different 


Muscle cells (blue, top) in the larva of the California broken- 
hearted worm are one of many cell types rearranged during 
the metamorphosis that produces an adult worm (bottom). 


stages of metamorphosis and from the juve- 
nile worm. Using those results, he grouped 
the larval and juvenile cells into different 
types—nerve cells, muscle cells, and the 
like. He also attached labels to DNA at dif- 
ferent time points so he could see where 
it—and the cell containing it—wound up as 
metamorphosis proceeded. 

The method could even identify cells that 
formed during metamorphosis. The proto- 
col is “a new way of asking questions about 
cell identities,” says Caroline Albertin, a 
developmental biologist at the Marine Bio- 
logical Laboratory who was not involved 
with the work. 

Depending on the cell type, one of 
three things happened. Muscle cells and 
some other cell types survived the trans- 
formation with little change. Other larval 
cells, including a few nerve cells, died off 
and disappeared. But to Bump’s surprise, 
many cells—perhaps almost half, in- 
cluding other nerve cells and gut cells— 
seemed to hang around but switched 
their repertoire of active genes, taking on 
different roles in the adult. “What a cell 
is, and is capable of, is more flexible than 
we previously appreciated,” he said at the 
meeting. “It seems that there are differ- 
ent cell types at different developmental 
stages,’ says Jose Aguilar, an evolution- 
ary biologist at NCSU who hopes to use 
the single-cell technique in his own work 
on other worms. 

Bump also noted clues to the evolu- 
tionary roots of metamorphosis. Around 
the time the worm’s larva began its trans- 
formation, many cells started to produce 
an enzyme that, in insects, helps activate 
a molecule called juvenile hormone. The 
worm appears to make a related hor- 
mone, Bump reported, and the timing 
suggests the hormone plays a role in the 
worm’s metamorphosis. Because juvenile 
hormone is known to regulate metamor- 
phosis in insects, the finding could mean 
that some of the molecular mechanisms 
controlling the process evolved in a com- 
mon ancestor of worms and insects. 

Elaine Seaver, a developmental bio- 
logist at the University of Florida, sug- 
gests more insights may be forthcoming. 
“We're starting to apply single-cell se- 
quencing to all kinds of animals. It’s a 
powerful tool to learn more about evo- 
lution and about life histories.” 
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Now in charge, House 
Republicans ramp up probes 


New investigative panels will delve into 
pandemic origins and research ties with China 


By Jeffrey Mervis and Jon Cohen 


he new Republican majority in the 

U.S. House of Representatives is 

preparing to shine a bright light on 

science—and _ scientists. Last week, 

it created two investigative panels 

that will scrutinize the country’s rela- 
tionship with China and its response to the 
COVID-19 pandemic. Both committees are 
expected to grill many prominent scientists 
and federal research officials on their actions 
over the past several years. 

In approach and style, however, they are 
likely to be very different. The China panel, 
officially the Select Committee on Strategic 
Competition between the United States and 
the Chinese Communist Party, is fueled by 
bipartisan concerns that China has used eco- 
nomic espionage to help it become a global 
leader in key high-tech fields, including en- 
ergy storage, artificial intelligence, and quan- 
tum computing. “It’s time to understand the 
urgency of the threat [posed by China] ... by 
working on a bipartisan basis,” says its new 
chair, Representative Mike Gallagher (R-WI), 
a former intelligence officer. 

In contrast, the Select Subcommittee on 
the Coronavirus Pandemic is widely regarded 
as an opportunity for House Republicans to 
criticize President Joe Biden’s administra- 
tion’s response to the global crisis. Approved 
on a party line vote of 220 to 213 as part of a 
package of new House rules, the panel is also 
expected to focus on claims, unsupported by 
evidence, that a laboratory in Wuhan, China, 
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either bioengineered or accidentally released 
SARS-CoV-2 obtained from bats. 

The China committee will build on previ- 
ous congressional reports that have ques- 
tioned the value of continued scientific 
collaboration with China despite the many 
students and scientists born in China who 
conduct research on U.S. campuses and 
run high-tech companies. The 365-to-65 
House vote to create the panel points to a bi- 
partisan consensus that U.S. higher educa- 
tion can do more to protect federally funded 
research from foreign influences, without 
sacrificing the traditional free flow of talent 
and ideas across national borders. 

Although a majority of House Democrats 
voted to create the panel, some worry it may 
be a way to revive the now-canceled China 
Initiative. Under former President Donald 
Trump’s administration that law enforce- 
ment initiative led to the criminal prosecu- 
tions of some two dozen academic scientists 
of Chinese ancestry. 

“We have always recognized that there 
are legitimate concerns with the actions of 
the [Chinese] government,’ said Represen- 
tative Judy Chu (D-CA), chair of the Con- 
gressional Asian Pacific American Caucus, 
who opposed setting up the special panel. 
“But this committee should not be used as 
an open invitation to engage in blatantly 
xenophobic anti-China rhetoric ... and to 
promote policies that result in the racial 
profiling of our communities.” 

Ties with China will be a focus of the 
pandemic panel as well. The Wuhan lab re- 


Representative James Comer (R-KY) will play 
a key role in the COVID-19 pandemic probe. 


ceived some U.S. government funding, lead- 
ing to charges that U.S. officials squelched 
concerns about the lab and attempted to 
mislead the public about the pandemic’s or- 
igins. Representative James Comer (R-KY), 
chair of the House oversight panel that will 
administer the select committee, has said it 
will investigate “what the U.S. government 
knew regarding the origins of COVID-19 
and when the government knew it.” 

The pandemic panel’s chair has yet to be 
named, but it is expected to aggressively 
pursue requests, sent last month by Comer 
and Representative Jim Jordan (R-OH), for 
detailed information from more than 40 aca- 
demic scientists and government officials. It 
also is likely to examine how the U.S. govern- 
ment regulates and funds “gain-of-function” 
research, which can make viruses more dan- 
gerous to humans in order to better under- 
stand the risks they pose and devise better 
vaccines and treatments. 

One high-profile target will be immuno- 
logist Anthony Fauci, former head of the 
National Institute of Allergy and Infectious 
Diseases (NIAID). Other targets include 
conservation biologist Peter Daszak, presi- 
dent of the EcoHealth Alliance, a nonprofit 
that subcontracted a grant from NIAID to 
the Wuhan laboratory, and four scientists 
involved in producing a 2020 Nature Medi- 
cine paper that concluded SARS-CoV-2 was 
not engineered in a laboratory. They are 
Kristian Andersen of Scripps Research, 
Michael Farzan of UF Scripps Biomedical 
Research, Robert Garry of Tulane Univer- 
sity, and Ian Lipkin of Columbia University. 

Comer has alleged that “Fauci was warned 
early on that the virus appeared manmade 
and pointed to a lab leak and instead of 
blowing the whistle may have attempted to 
cover it up.” Fauci has rejected allegations 
of any cover-up and says he is willing to an- 
swer questions from Congress. EcoHealth 
issued a statement saying it has “cooper- 
ated with every bipartisan initiative that has 
contacted us” and told Comer and Jordan 
it will “assist” lawmakers in their effort “to 
understand the scientific evidence behind 
COVID-19’s origins.” 

Other researchers contacted by Comer and 
Jordan said they are reviewing the requests 
to provide a wide array of “documents and 
communications” related to the Nature Medi- 
cine paper and other issues. 

Neither select panel has the power to 
write or approve legislation. And any pol- 
icy recommendations would have to win 
enough votes in the Democrat-controlled 
Senate and avoid a presidential veto to be- 
come law. & 
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Richard Gill is fighting the shoddy statistics 
that put nurses in prison for serial murder 


By Cathleen O’Grady, in Leiden, the Netherlands 


hen a Dutch nurse named Lucia de 
Berk stood trial for serial murder 
in 2003, statistician Richard Gill 
was aware of the case. But he saw 
no reason to stick his nose into it. 
De Berk was a pediatric nurse 
at Juliana Children’s Hospital in The Hague. 
In 2001, after a baby died while she was on 
duty, a colleague told superiors that De Berk 
had been present at a suspiciously high num- 
ber of deaths and resuscitations. Hospital 
staff immediately informed the police. When 
investigators reexamined records from De 
Berk’s shifts, they found 10 suspicious inci- 
dents. Three other hospitals where De Berk 
had previously worked added another 10. 
The probability of such a pattern happen- 
ing by chance was one in 7 billion, the police 
said. De Berk was arrested on 13 December 
2001, suspected of murdering five children. 
Newspapers called her a “murder nurse” and 
an “angel of death.” 

Gill, then working as a statistics profes- 
sor at Leiden University, remembers his 
wife telling him about a “witch trial” and 
saying, “They’re using statistics; you should 
get involved, do something useful.” But Gill 
knew the statistician working on the case 
and considered him a decent, careful person. 
“So I thought I didn’t have to. And anyway, 
I was obsessed with quantum mechanics,” 
he says. In 2003, De Berk was found guilty 
of four murders and three attempted mur- 
ders and sentenced to life in prison. An ap- 
peals court convicted her again in 2004. The 
Dutch Supreme Court upheld the conviction 
2 years later. 

It wasn’t until late 2006, when Gill read 
two whistleblowers’ account of the trial, that 


228 20 JANUARY 2022 * VOL 379 ISSUE 6629 


he started to look into the case—and became 
incandescent. Tunnel vision, bad statistics, 
and poor human intuitions about coincidence 
had marred the investigation. When Gill ran 
the numbers himself, he found the string of 
deaths on De Berk’s watch might well be en- 
tirely due to coincidence. Along with fellow 
statisticians, whistleblowers, and others, Gill 
campaigned for a retrial that eventually led 
to De Berk’s exoneration in 2010. Her case 
is now considered one of the worst miscar- 
riages of justice in the Netherlands. 

It also opened a new chapter in Gill’s pro- 
fessional life: He became a leading expert on 
the statistics of medical murder cases simi- 
lar to De Berk’s—and a loud, persistent voice 
warning of the shoddy statistics that are 
sometimes central to prosecutors’ arguments. 
“In a normal murder case, you actually have 
a body which has clearly been murdered,’ he 
says. When there’s only a suspicious cluster of 
deaths, investigators may assume a murderer 
is at work and selectively focus on evidence 
that supports that assumption. People’s intu- 
ition of an “impossible coincidence” joins the 
dots in the evidence. 

Gill worked with defense lawyers and 
campaigned—in vain—to overturn the con- 
viction of British nurse Ben Geen, found 
guilty in 2006 of two murders and 15 counts 
of grievous bodily harm. He also helped 
secure the October 2021 acquittal of nurse 
Daniela Poggiali, accused of two murders in 
a high-profile case in Italy. By now, the mis- 
use of statistics has drawn enough attention 
that prosecutors sometimes insist their evi- 
dence is not statistical, Gill says, but often, 
“hidden statistics” seep into the cracks. 

In a report peer reviewed and distributed 


by the Royal Statistical Society (RSS) in Sep- 
tember 2022, Gill and colleagues detailed 
the statistical missteps in past medical 
murder trials and made recommendations 
for how legal systems can do better. Gill 
hopes the report will help with the case 
of another British nurse, Lucy Letby, who 
is now on trial for the alleged murder of 
seven babies and attempted murder of 
10 more in a neonatal unit at the Countess of 
Chester Hospital. 

“Similar issues have arisen across many, 
many different jurisdictions,” says crimino- 
logist William Thompson, professor emeri- 
tus at the University of California, Irvine, 
and a co-author of the RSS report. “The same 
investigative dynamics play out ... the same 
cognitive biases, and the tunnel vision.” Gill 
likes to point out such errors with an outspo- 
kenness that frequently ruffles feathers, says 
statistician Peter Griinwald of the Center 
for Mathematics and Informatics (CWI) in 
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Amsterdam, a friend and colleague who also 
campaigned for De Berk’s retrial. “He will 
give very radical opinions. ... But somehow 
he’s a very pleasant person to disagree with.” 


GILL HAS NOT ALWAYS been a troublemaker. 
His career has been defined by long detours 
into the bowels of arcane mathematical 
problems. Injustice bothers him—but so does 
error. He spends a lot of time debating quan- 
tum mechanics “crackpots” on the internet. 

Gill had a serene childhood in the English 
countryside. His father, a physicist, spent his 
career in industry. His mother, Gill discov- 
ered after World War II intelligence was de- 
classified in 1974, had been one of the human 
“computers” who helped crack Germany’s 
Enigma code at an outstation of Bletchley 
Park. “I wasn’t surprised,” he says. “I always 
thought I got my brains as much from her as 
from my father.” 

At the University of Cambridge, where 
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he studied math, it was statistics that most 
captured his attention. It had “weighty ethi- 
cal and philosophical implications,’ he says. 
“Tt was a branch of mathematics that really 
means something.” 

As a student, he was not much of an activ- 
ist. He says he feels guilty about not speaking 
up more about injustice when he was young. 
One incident in particular haunts him: his 
role as a Statistician in a 1970s experiment 
that severed the front legs of rats to inves- 
tigate whether bipedalism reshaped their 
skulls. “What upset me most is that I didn’t 
have the strength of character to refuse to do 
that job.” 

The study was one of Gill’s first assign- 
ments as a Statistical consultant at a prede- 
cessor to CWI. Newly married, with three 
children born in quick succession, his most 
pressing concern was finding and keeping a 
good job, and the variety of consulting proj- 
ects at the center fit neatly with his desire to 


do something practical. He obtained a Ph.D. 
on the mathematical underpinnings of “sur- 
vival analysis,” the study of the expected time 
until an event—such as a mechanical failure 
or a death in a clinical trial—occurs. Later, 
statistical problems in quantum mechanics 
were his main focus. 

And then he started to look into the story 
of Lucia de Berk. 


DE BERK’S CASE became famous for a num- 
ber: one in 342 million. That was the prob- 
ability that the many “incidents” on her 
shifts were due to random bad luck, accord- 
ing to Henk Elffers, a law psychologist then 
at the Netherlands Institute for Crime and 
Law Enforcement and an expert witness for 
the prosecution. His figure was less stark 
than the police’s one in 7 billion, but still 
very damning. 

Elffers’s reasoning was controversial and 
came under fire from statistical experts 
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Lucia de Berk (facing camera) embraces Metta de Noo, who played a key role in her defense, after her exoneration in 2010. De Berk spent more than 6 years in prison. 


during De Berk’s appeals. He had multi- 
plied the probability of De Berk’s pattern of 
death across multiple wards, which would 
make any nurse look guiltier with each job 
change. For example, even a mundane one in 
20 chance at one hospital, and the same 
chance at the next, would transform into a 
more suspicious one in 400 chance. 

But prosecutors had additional evidence: 
Investigators had found traces of the heart 
medication digoxin in the body of one al- 
leged victim and an overdose of the sedative 
chloral hydrate in another. At De Berk’s first 
appeal—essentially a retrial—the Court of 
Appeal in The Hague ruled this evidence of 
foul play meant other deaths could be safely 
attributed to De Berk based on weaker evi- 
dence, including the overall “pattern” of in- 
cidents, and her diary, which spoke of her 
“very great secret” and “compulsion.” The 
appeals court convicted De Berk again, add- 
ing three additional murders to her count. 

De Berk, who suffered a stroke 5 days 
after the Supreme Court rejected her 
second appeal in 2006, maintained her 
innocence throughout. 

That might have been the end of the case if 
it hadn’t been for Metta de Noo, a geriatrician 
who had inside information. De Noo’s sister- 
in-law was the head pediatrician at Juliana, 
where De Berk worked, and had aided the 
police investigation. But when De Noo ex- 
amined documents from the case, she found 
what she believed were flaws in the medical 
evidence. The infant who had allegedly died 
of digoxin poisoning had been declining for 
days after heart surgery. And the hospital had 
prescribed the maximum dose of chloral hy- 
drate for the other child, allowing additional 
doses if needed. De Berk had been agitating 
for doctors to pay attention to the child’s de- 
teriorating condition. 
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When De Noo asked specialists for sup- 
port, she met with hostility and ridicule. 
Her doggedness destroyed her good rela- 
tionship with her brother and his wife. She 
eventually turned to Ton Derksen, her other 
brother and a philosopher of science who 
had spent his career writing about flaws in 
reasoning of the type that permeated the De 
Berk investigation. 

With De Noo’s help, Derksen published 
a bombshell book in 2006: Lucia de B.: Re- 
construction of a Miscarriage of Justice. (In 
the Netherlands, suspects’ last names are 
commonly withheld to protect their privacy.) 
Derksen dismantled the figure of one in 
342 million, giving a meticulous account of 
statistical errors, weak medical evidence, and 


“We humans are terribly good 
at seeing patterns 
when they’re not there.” 


Peter Green, University of Bristol 


bias in the investigation. For example, inves- 
tigators examining the “incidents” connected 
with De Berk had classified deaths and resus- 
citations as suspicious when she was on duty, 
and not suspicious when she was off. 

The prosecution had also argued that De 
Berk’s ward had seen a total of five deaths be- 
tween 1996 and 2001, and all had occurred 
after De Berk had started working in 1999. 
But the ward had a different name until 1999, 
and earlier deaths were excluded, Derksen 
found. In reality, there were seven deaths in 
the 3 years before De Berk joined and six in 
the 3 years after. (De Noo published her own 
account of the case—and the way it tore her 
family apart—in 2010.) 


Griinwald, then a young assistant pro- 
fessor, brought Derksen’s book to Gill’s at- 
tention and asked whether he would join a 
campaign for De Berk’s case to be reopened. 
Gill says reading the book made him “ab- 
solutely furious” with himself for trusting 
Elffers and not getting involved earlier. And 
he was angry that the appeals court had 
claimed its verdict did not rely on statistics: 
“Ton Derksen showed that it was soaked 
in statistics.” 

Gill quickly reanalyzed the data himself. In 
a write-up posted online in January 2007, he 
reported a much less outlandish probability 
of one in 100,000—even before removing bi- 
ases in the data. Gill has refined his analysis 
over the years, building in complexities such 
as the fact that nurses could be expected to 
have different mortality rates based on their 
skill, choices, and work patterns. In a paper 
in Chance in 2018, he and colleagues calcu- 
lated a probability of one in 49. 

In 2007, convinced of De Berk’s innocence, 
Gill organized a petition to reopen the case. 
His quantum mechanics work was “useful af- 
ter all,’ he says, because he persuaded Nobel 
Prize-winning physicist Gerard ‘t Hooft to 
sign, which generated headlines. But in other 
ways Gill was less diplomatic. He called some 
doctors “criminals” and said “outrageous 
things” to journalists, Griinwald says: “Metta, 
Ton, and I basically had to hold him back.” 
Haga Hospital even threatened to sue him af- 
ter he posted previously unpublished details 
about the case on his website. 

Yet Griinwald says Gill’s cheerful fearless- 
ness was crucial. Many Dutch statisticians 
knew and liked Elffers, he says. “People 
... were afraid to say out loud that he was 
doing something stupid and nonsensical. 
Richard had no problems with that at all.’ 
(Elffers declined to comment for this story.) 
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The efforts paid off. In 2006, the Commis- 
sion for the Evaluation of Closed Criminal 
Cases decided to reconsider the case and 
appointed a subcommittee to investigate. 
In a “drab government building” in The 
Hague, Gill helped explain how bungled sta- 
tistics had put De Berk in prison. In 2007, 
the commission recommended reopening 
the case; in 2008, the Dutch Supreme Court 
agreed. That same year, the Dutch govern- 
ment suspended De Berk’s sentence and she 
was released from prison, pending a retrial. 


THE MISTAKES in De Berk’s case were far 
from unique, Gill and others say. “We hu- 
mans are terribly good at seeing patterns 
when they’re not there,” says statistician 
Peter Green, a professor emeritus at the 
University of Bristol and one of the RSS 
report’s authors. 

Investigators sometimes enhance those 
patterns by only tallying the evidence that 
confirms their theory, discarding or not 
even noticing data that don’t. Even inves- 
tigators who aim to be unbiased can make 
minor choices that add up to a skewed pic- 
ture, Thompson says (see graphic, right). 
“You end up with a piece of evidence that 
looks extraordinarily unlikely to have oc- 
curred by chance. And of course, the prob- 
lem is it didn’t exactly occur by chance—you 
kind of helped it along.” 

Gill worries this is what led to the 2006 
conviction of Geen, who was given 17 life 
sentences, with a minimum term of 30 years. 
Prosecutors argued there was a high rate 
of unexplained respiratory arrests—which 
are typically rarer than cardiorespiratory 
arrests—on Geen’s shifts, although they did 
not try to quantify the probability that this 
“unusual pattern” occurred by chance. As in 
De Berk’s case, there was other evidence, in- 
cluding the fact that Geen had a syringe con- 
taining muscle relaxant in his pocket when 
he was arrested. The prosecution argued that 
he had injected patients with the drug in or- 
der to cause respiratory arrest and then play 
the hero by resuscitating them. 

Geen’s defense lawyers challenged the “un- 
usual pattern” in a 2009 appeal, submitting 
a report by University of Warwick medical 
statistician Jane Hutton. The appeals court 
upheld the conviction. “The judges seemed 
to be very overconfident that they could de- 
tect an unusual pattern without putting in 
some of the most basic information that you 
need as a comparison,” Hutton says. 

In a 2022 paper published in Laws, Gill 
and colleagues argued that blinded inves- 
tigators might have reached different con- 
clusions about Geen’s case. The high rate 
of respiratory arrests on his shifts was ac- 
companied by a drop in cardiorespiratory 
arrests, suggesting a bias in how these cases 
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A murderous nurse at work—or just coincidence? 

Cognitive biases can easily lead an investigation astray and have drastic effects on how suspicious a cluster 

of deaths seems. In this imaginary example drawn from real-world errors, a doctor reports that many deaths 
seem to occur while Nurse X is on duty. The hospital launches an investigation, reexamining deaths at Nurse 
X's ward over the past 2 years. A simple statistical test* compares the rate of suspicious deaths when Nurse 
X is on duty with the rate when she is off. It then calculates the probability of seeing this pattern purely as a 


result of random chance.** The outcome depends greatly on the type of investigation. 


Type of investigation 


Severely biased 


@ Deaths with Nurse X on duty 


Other deaths 


Investigators look at deaths on Nurse X’s shifts and find nine that were medically 


unexpected. They judge that two more deaths should be added: one that occurred 


Deaths: 11 versus 0 
Probability: 1 in 83,000,000 


15 minutes after her shift ended, and another that a pathologist did not consider 
unexpected at the time, but now finds suspicious knowing Nurse X was present. 


e 
e @ ee e 
18 24 


Months 


Less biased 


Investigators reexamine all deaths over the past 2 years, not just those 


on Nurse X's shifts, finding five unexpected deaths when she was not present. 


Deaths: 11 versus 5 
Probability: 1 in 10,400 


Blinded 


The patterns looks far less suspicious now. 


Investigators reexamine deaths without knowing which nurse was on duty. 


They do not assign to Nurse X the death that occurred after her shift ended 


Deaths: 9 versus 6 
Probability: 1 in 488 


Careful 


and do not reclassify the death considered normal at the time. 


Researchers look for other factors that could explain the pattern, such as changes 


in hospital policy. They find that Nurse X often worked the morning shift, when 


Deaths: 7(a.m.) 2(p.m.) versus 
2(a.m.) 4(p.m.) 


Probability: 1 in 45 


hospitalized patients are known to die more often than at other times of the day. 
Nurse X’s deaths now look even less suspicious. 


* Log-linear model and chi-sq test. Method adapted from Appendix 5 in “Healthcare serial killer or coincidence? Statistical issues in investigation 
of suspected medical misconduct,” published by the Royal Statistical Society in 2022. Script available at https://fal.cn/UnluckyNumbers. 
** The p-value gives the probability of finding these results—or more extreme ones—if the hypothesis that the nurse killed the patients is not true. 


were classified. Compared with data from 
the same hospital over a wider time period, 
the deaths and resuscitations on Geen’s 
shifts do not seem extraordinary, Gill and 
his co-authors said. He and other statisti- 
cians wrote letters of support in 2015 when 
Geen asked the Criminal Cases Review Com- 
mission to look into his case. The request was 
denied; Geen remains in prison. 


EVEN WHEN statistical experts do get in- 
volved in a case, they may not be immune to 
errors of reasoning, as Elffers’s work showed. 
In the case of Poggiali, the Italian nurse, 
statisticians wrote that a very high level of 
statistical significance is a “guarantee” that 


“there is a causal effect”—in this case between 
Poggiali being on duty and the deaths. But 
this is a well-known error of reasoning: 
“Correlation is not causation,’ Green says. 
Thompson says clusters may have surpris- 
ing causes that are difficult or impossible to 
uncover. He points to cases where chemicals 
leached from equipment or changes in baby 
formula were at fault. 

Gill and his colleagues found that 
Poggiali’s death rate was higher than her col- 
leagues, even after various controls, but ar- 
gued this could be at least partly explained by 
Poggiali’s long hours—she arrived very early 
and left late from her shifts—which meant 
she was present at more death certifications 
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Mothers damned by statistics 


ext month, a judge in Sydney will 

hear new expert testimony ina 

criminal case that has fascinated 

Australia for 2 decades: that of 

Kathleen Folbigg, who in 2003 was 
convicted of the murder of three of her 
infant children and manslaughter in the 
death of the fourth. 

There is no medical evidence that 
Folbigg’s children were murdered. Her 
case rests partly on the vanishingly 
small chance that unexplained medical 
tragedy would strike the same family 
four times. Like some other infanticide 
cases, it parallels the murder convic- 
tions of doctors and nurses based on 
suspicious clusters of patient deaths 
(see main story, p. 228). As those cases 
show, seemingly common-sense sta- 
tistical assumptions can mislead—with 
horrifying consequences. 

Folbigg’s children all died between 
1989 and 1999, at ages between 
19 days and 19 months. Her husband 
reported Folbigg to the police after 
discovering her diary, in which she had 
described anger and frustration with 
her children, and a sense of responsi- 
bility for their deaths: “With Sarah all | 
wanted was her to shut up. And one day 
she did.” 

For each child, doctors found pos- 
sible, but not definitive, evidence for 
natural causes of death. Yet taken 
together, expert witnesses said, the 
deaths were suspicious, because mul- 
tiple cases of sudden infant death syn- 
drome (SIDS) within a single family are 
extremely rare—let alone four of them. 
The New South Wales Supreme Court 


sentenced Folbigg to 40 years in prison, 


reduced to 30 years by a 2005 appeal. 
A 2019 inquiry upheld her conviction, 
and a 2021 appeal was dismissed. 

Critics say the case rested heavily on 
the reasoning popularized in the 1990s 
by British pediatrician Roy Meadow, 
who asserted that with respect to 
child deaths, “one is a tragedy, two is 
suspicious, and three is murder unless 
there is proof to the contrary.” Pedia- 
trician Susan Beal cited a variation of 
“Meadow’s law” during a 2003 hearing 
on what evidence could be admitted in 
Folbigg’s trial. 

Meadow testified in court cases him- 
self as well. But his reputation fell apart 
after the case of British solicitor Sally 
Clark, who in 1999 was convicted of 
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murdering her two infant sons. Meadow 
testified that the chance of two SIDS 
deaths in a low-risk family like Clark’s 
was one in 73 million. That calculation 
assumed SIDS could not have inherited 
risk factors, statistician Phil Dawid of 
the University of Cambridge wrote ina 
report for Clark's first appeal in 2000. 
He put the chance of the two deaths 

at a less outlandish one in 1 million, 

“or even much higher,” and pointed out 
that double infanticide is also vanish- 
ingly rare. The court should weigh both 
rare possibilities against each other, he 
says, along with all the other evidence. 

Clark lost the appeal, but she was 
exonerated at a second appeal in 2003, 
partly because it came to light that 
pathologist Alan Williams had failed 
to disclose evidence that one of the 
babies had Staphylococcus aureus 
in his spinal fluid, a possible natural 
cause of death. The appeal judges said 
Meadow’'s statistical evidence—which 
could have had “a major effect” on the 
jury—should not have been admitted. 

Williams was barred from working 
for the U.K. Home Office for 3 years and 
Meadow lost his medical license, a deci- 
sion later overturned by the U.K. High 
Court. After the scandal, the attorney 
general ordered a review of 297 infanti- 
cide cases, and decided to drop charges 
in three cases and review the convic- 
tions in 28 others. 

There may be exculpatory medical 
evidence in Folbigg’s case as well. In 
2020, a group of researchers led by 
Peter Schwartz at the Italian Auxo- 
logical Institute published a paper 
showing Folbigg’s two daughters both 
had a newly discovered genetic variant 
that impairs cells’ ability to regulate 
calcium, leading to a greatly increased 
risk of cardiac arrhythmia and sudden 
death. The paper led the Australian 
Academy of Science and Folbigg’s 
lawyers to launch a petition in March 
2021, signed by 90 scientists, asking 
New South Wales Governor Margaret 
Beazley to pardon Folbigg. Beazley 
ordered a new inquiry; hearings are due 
to begin in February. 

Clark, despite her vindication, never 
recovered and died of acute alcohol 
poisoning in 2007. Her family and the 
coroner's office attributed the death to 
severe distress from the “catastrophic 
experience.” —C.0.G. 


during shift handovers. They also pointed out 
a statistical flaw in the medical evidence: A 
toxicologist had said the potassium concen- 
tration found in one of the victim’s eyes 
was unexpectedly high, suggesting potas- 
sium chloride poisoning. But this did not 
take into account any statistical uncer- 
tainty in the data on expected levels of 
potassium, Gill and colleagues wrote in a 
2021 paper in Law, Probability and Risk 
summarizing the findings that had helped 
secure Poggiali’s acquittal. 

The Letby case now in court shows many 
of the same troubling features as earlier 
cases, Gill and others say. Letby was moved 
to clerical duties in 2016 after a series of 
deaths and resuscitations on her shifts, 
and first arrested in 2018. She is accused 
of murdering seven babies and attempt- 
ing to murder 10 more, using methods 
such as insulin poisoning and injection of 
air bubbles. 

The similarities go beyond statistics 
to the way Letby has been vilified. Social 
media commentary will “make your stom- 
ach turn,” Gill says. “People are saying 
we should bring back hanging, shoot the 
bitch.” The media have portrayed her as 
an “evil creature,” says Neil Mackenzie, a 
lawyer based in Edinburgh, Scotland, who 
specializes in medical negligence cases and 
co-authored the RSS report. “I think there’s 
possibly misogyny in there,’ Mackenzie 
says. “The press loves bad women.” 

The RSS report Gill and others published 
in September does not claim Letby is inno- 
cent, in part because public comment on the 
guilt or innocence of a person standing trial 
may be considered contempt of court in U.K. 
legal systems. “We’ve got to have no opinion 
on this case,’ Green says, but “there’s poten- 
tial here for miscarriage of justice.” 

Gill says a deep cognitive bias works 
against defendants like Letby. People 
“don’t believe in chance, actually,’ he says. 
“Quantum mechanics has been shouting at 
us for 100 years that the physical universe 
is built on randomness. ... But we don’t un- 
derstand this. It upsets us deeply. When a 
succession of bad things happens, we know 
there must have been an agent responsible. 
And so we naturally believe in devils and 
witches, gods and angels.” 


NOT ALL MEDICAL MURDER cases are witch 
hunts, however. “This is an instance 
where there actually are some witches,” 
Thompson says. 

In 2000, for example, a British physician 
named Harold Shipman was convicted of 
murdering 15 patients over a period of 
3 years after an investigation yielded evi- 
dence that he had given overdoses of di- 
amorphine—heroin, used in the United 
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Kingdom for severe pain—and _ falsified 
the medical records of numerous patients, 
suggesting they had been sicker than they 
were to make their deaths appear less sus- 
picious. (Shipman was in one patient’s will 
but his motives have not become clear.) 
Shipman, who is suspected of killing hun- 
dreds more, was sentenced to life in prison 
and died by suicide in 2004. 

A 5-year government inquiry in the wake 
of the case identified ways to better protect 
patients, such as more oversight of death 
certificates. The case also led statisticians to 
explore a new question: Could statistics de- 
tect real murderers, based on a suspicious 
pattern of deaths alone? 

Cambridge statistician David Spiegelhalter, 
who gave advice to the panel, believes so. 
He and his colleagues adapted a method 
from industrial quality control to com- 
pare the rate of death certificates signed 
by Shipman over time with deaths at other 
local doctors’ practices. They found they 
could have identified a worrying pattern 
in Shipman’s patients 13 years before he 
was arrested. 

Such a system would produce false 
alarms; more people might die under the 
care of a doctor or nurse who works with 
particularly difficult cases, for example. 
But a robust method would prevent too 
many misfires, Spiegelhalter says, and a 
“ping” in the system should never be taken 
as anything more than a sign that a human 
should look at the data. 

But implementing this kind of routine 
monitoring would be very complicated, 
says Bruce Guthrie, professor of general 
practice at the University of Edinburgh. 
The kind of data Spiegelhalter and his col- 
leagues used is not routinely collected— 
it was pieced together as part of the 
Shipman investigation. And Shipman 
worked alone, which few family doctors 
do; many patients are likely to see multiple 
doctors. Only the “most horrendously pro- 
lific murderers” would be likely to show 
up, Guthrie wrote in an email to Science. 

Meanwhile Thompson, Gill, and oth- 
ers are calling for cultural and institu- 
tional fixes to prevent unjust convictions. 
Many lawyers find statistics challenging, 
Mackenzie says. “This is one of the evils of 
the ‘two cultures’ myth,” he says: Some stu- 
dents are channeled into scientific subjects, 
and others into humanities, and “never the 
twain shall meet.” Niamh Nic Daeid, a fo- 
rensic science researcher at the University 
of Dundee, says she routinely encounters 
anxiety and resistance about statistics. Nic 
Daeid, Spiegelhalter, and others have pro- 
duced a range of statistics training materi- 
als, including an RSS “primer” and a free 
online course for lawyers. 
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But training is not enough, Thompson 
says, because the biases that underlie er- 
rors are “built into our perceptual pro- 
cesses.” Instead, he says, it’s crucial to 
change investigative procedures. The RSS 
report recommends that investigators be 
blinded. For example, pathologists should 
classify deaths as suspicious or not without 
knowing which medical personnel were 
in attendance, adapting the standardized 
blinding methods used in epidemiology to 
study disease outbreaks. 

But blinding has proved to be a hard 
sell among forensic scientists, in part be- 
cause it’s often more challenging than it 
seems, says Peter Stout, CEO of the Hous- 
ton Forensic Science Center and a strong 


GILL WAS AT THE COURT of Appeal in Arn- 
hem in 2010 when De Berk’s exoneration 
was announced. “It was one of the big- 
gest events of my life,” he says. “It was 
really joyful.” De Berk was immediately 
rushed off by her lawyers and journalists 
swarmed Derksen and De Noo. “I bought a 
marijuana cigarette,” Gill says, and then he 
took a train to The Hague and went to the 
beach. “I smoked my joint, and I ate a dish 
of oysters, and drank some white wine.” 
De Berk later received a written apology 
from the Dutch minister of justice and an 
undisclosed financial compensation for 
the 6.5 years she spent in prison. Gill stays 
in contact with her; she likes his posts on 
Facebook sometimes. She told Gill she did 


““Thisis an instance 
where there actually 


are Some witches, 


William Thompson, 
University of 
California, Irvine 


Graves were exhumed in Hyde, United Kingdom in 1998 to search for evidence against physician 
Harold Shipman, who was convicted of killing 15 patients in 2000. 


advocate of blinding and other measures 
to improve forensic science. It can mean, 
for example, that a forensics lab—already 
strapped for funding and time—needs an 
extra person to serve as a case manager 
who screens possibly biasing information 
from a blinded analyst. And the line be- 
tween relevant and irrelevant information 
is not always clear. 

Adele Quigley-McBride, a cognitive 
bias researcher at Duke University, trains 
analysts in a technique called sequential 
unmasking. The method gets around the 
blurred line between relevant and irrel- 
evant information by giving investigators 
access to increasing amounts of informa- 
tion with each round of analysis. Analysts 
note their observations and conclusions in 
each round; if new information changes 
their opinion, they have to explain why. 


not want to be interviewed for this story. 
“She’s managed to put it all far away, and 
she needs to keep it that way,” he says. 

Thirteen years on, Gill, now retired, is 
watching the Letby case closely, but his ob- 
session with forensic statistics has begun 
to subside. His retirement projects include 
a range of statistical kerfuffles with lower 
stakes, such as the rating of Dutch herring 
sellers. He has plenty of other things to oc- 
cupy his attention—winemaking, an ama- 
teur distillery, grandchildren. “I think I’ve 
reached the point where I want to spend 
more time in the forest picking mush- 
rooms, actually,” he says. 

He hopes younger statisticians will feel 
compelled to help when bad statistics lead 
to injustice, as he did. “I sensed that in the 
Lucia case, I could make a difference,” Gill 
says. “And that therefore I must.” 
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Light pollution is skyrocketing 


Data from citizen scientists reveal a worrying growth 
in light pollution over the past decade 


By Fabio Falchi*? and Salvador Bara'? 


rtificial lighting at night is contributing 
to a rapid increase in light pollution, 
as reported by Kyba et al. (1) on page 
265 of this issue. By analyzing tens of 
thousands of observations made by 
citizen scientists around the world 
over the past 12 years, the authors found that 
the dimmest stars in the night sky are being 
hidden by a 10% yearly increase in the sky 
background as a result of artificial light. This 
growth is difficult to discern with satellites 
now in operation because their detectors are 
blind to the blue light emitted by light-emit- 
ting diodes (LEDs), which are progressively 
replacing older lamps (such as high-intensity 
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discharge). The results of the study point to 
the need for satellites that can detect the vis- 
ible light in different color bands and make a 
strong case for reversing light pollution and 
its wide-ranging negative effects, from eco- 
logical to cultural to health-related. 

Kyba et al. studied the collection of obser- 
vations of the Globe at Night project (https:// 
www.globeatnight.org). More than 50,000 
observations were submitted by citizens—us- 
ing the naked human eye—to detect the dim- 
mest stars visible in a site. The data allowed 
the authors to determine the background 
brightness of the night sky. The brighter the 
night sky is, the brighter a star must be to be 
seen by the unassisted human eye. Analysis 
of data from the past dozen years (from 


2011 to 2022) show that the sky brightness 
resulting from artificial light is increasing 
exponentially in the world with an alarm- 
ing average of 10% each year—i.e., doubling 
in less than 8 years. This increase is much 
higher than estimates of the evolution of ar- 
tificial light emissions (~2% yearly) based on 
radiance measurements taken by the Defense 
Meteorological Satellite Program and Suomi 
National Polar-orbiting Partnership satel- 
lites (2, 3). Part of this discrepancy could be 
explained by the inability of these satellites 
to detect blue light emitted by LEDs, which 
started to be used outdoors ~10 years ago. 
These satellites are also unable to detect light 
emitted almost horizontally, such as that 
from ultrabright LED billboards and build- 
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ing facades. These weak points must be over- 
come by next-generation satellites equipped 
with both multiwavelength and multiangle 
monitoring capabilities (4, 5) that this global 
environmental problem (6) deserves. 

The study of Kyba et al. is biased in that 
most of the data from citizen scientists are 
from Europe and North America. This does 
not subtract value from the findings; on the 
contrary, they are reinforced because these 
continents show a constant or slightly de- 
creasing light pollution trend, as determined 
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Light pollution intrudes on the Kitt Peak National 
Observatory in Arizona, USA. 


by satellites (2). The other continents show 
an increasing light pollution trend with satel- 
lites, so a greater increase would be expected 
for those continents when citizen science 
data become available. 

Perhaps the most important message 
that the scientific community should glean 
from the Kyba et al. study is that light pol- 
lution is increasing, notwithstanding the 
countermeasures purportedly put in opera- 
tion to limit it. Looking at the International 
Space Station’s images and videos of Earth’s 
night hemisphere, people generally are only 
struck by the “beauty” of the city lights, as 
if they were lights on a Christmas tree. They 
do not perceive that these are images of pol- 
lution. It is like admiring the beauty of the 
rainbow colors that gasoline produces in 
water and not recognizing that it is chemi- 
cal pollution. Awareness must greatly in- 
crease for artificial light at night to be per- 
ceived not as an always-positive thing, but 
as the pollutant it really is. 

Most people associate artificial light with 
road safety and personal security, connec- 
tions that are not well supported by evidence 
(7, 8). Thus, year after year, more lights are in- 
stalled to brighten the night. However, there 
are notable negative consequences that are 
on the rise. Astronomers were quick to real- 
ize that night lights affected their research, 
and hence they moved observatories farther 
from cities. Unfortunately, as the use of out- 
door lights spreads, this strategy has become 
less effective. In addition, as the efficiency of 
light technology improves, more lighting is 
possible for the same amount of money. This 
is translating into a flourishing of new lights 
to illuminate roads, buildings, and natural 
landscapes; for digital billboards; and for 
garden “beautification,” among other appli- 
cations. To make things worse, thousands of 
satellites are being launched by a few cor- 
porations into low Earth orbits. This will 
prevent all humankind from admiring a 
pristine sky anywhere on the planet. The 
wilderness experience one can have in a 
national park at night, for example, may be 
increasingly affected by the glows of cities 
and satellite megaconstellations. The loss 
of the starry night is an unprecedented loss 
for all cultures. 

Light pollution is also, and mainly, an 
environmental problem (9). Life on Earth 
evolved with sunlight during the day and 
starlight and the light from the Moon dur- 
ing the night. It is expected that introduc- 
ing artificial light into ecosystems at levels 
that surpass the amount of natural light ex- 
perienced in natural conditions will trigger 
changes in animal behavior. Some species, 


including predators, may take advantage of 
more light. Other species will be in danger of 
local or global extinction, which will reduce 
biodiversity (10). Moreover, animals, includ- 
ing humans, will diminish their production 
of melatonin in response to artificial light at 
night (77). Melatonin is a fundamental hor- 
mone that controls the internal circadian 
clock of animals, and thus animal physiol- 
ogy. A lack of this hormone could have nu- 
merous negative consequences on health 
(12). It is also expected that processes needed 
to produce, install, and operate outdoor arti- 
ficial lighting will release greenhouse gases 
into the atmosphere and contribute to global 
warming. An estimated 200 billion kg of car- 
bon dioxide is generated to produce the ~400 
terawatt-hours of electric energy needed for 
outdoor lights (73). 

Attempts to control light pollution have 
been carried out in the past decades, includ- 
ing aiming light below the horizon plane. 
This approach is not sufficient because any 
new light, even if shielded, will add pollu- 
tion to the night environment after being 
reflected off the surfaces being lit. New ap- 
proaches that treat light as a true pollutant 
could include introducing total caps and red 
lines on its production that are based on the 
control of light pollution indicators, such as 
the night sky zenith luminance or the illumi- 
nance on the ground coming from the entire 
sky (14, 15). This would be analogous to the 
reduction of most air contaminants that was 
achieved through legislative interventions, 
such as the Clean Air Act in the United States. 

Light pollution is an environmental prob- 
lem and as such should be confronted and 
solved. This may restore the need for an un- 
contaminated night environment by animals 
and humankind. 
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CATALYSIS 


Clicking zeolites together 


A new mechanism to synthesize catalytic zeolites gives a previously unknown topology 


By Russell E. Morris 


eolites are some of the most successful 

catalysts used in industry, particularly 

in petrochemicals processing. The to- 

pology of zeolites is important because 

this controls access to the catalytic 

sites inside these porous solids. There 
is a growing need for new catalytic materi- 
als that will accommodate larger substrates 
that are common in, for example, bio-based 
fuel refining. On page 283 of this issue, Li 
et al. (1) describe a mechanism by which an 
extra-large-pore zeolite is made by “clicking” 
together one-dimensional (1D) silicate chains 
that are present in a precursor solid. The syn- 
thesis hinges on the precise arrangement of 
the silicate chains in the precursor, which are 
aligned to connect to a highly crystalline and 
almost defect-free product. The new zeolite 
also features structural units that offer hith- 
erto unforeseen potential for distinct applica- 
tions such that the size of the zeolite pores 
matches the substrate better than what is 
possible now. 

Zeolites are porous, silicate-based solids 
whose catalytic function comes from the ad- 
dition of other elements into the framework 
structure. The nature of the catalytic func- 
tionality depends on the specific element 
that is added: For example, aluminum and 
titanium enable acid and redox catalysis, 
respectively. The zeolite topology controls 
access to this catalytic functionality, limit- 
ing which molecules can react according to 
those that can diffuse into or out of the pores. 
Zeolites are traditionally prepared using crys- 
tallization techniques under hydrothermal 
conditions (2). This is a dynamic process in 
which the successive making and breaking of 
chemical bonds leads to the growth of highly 
ordered crystal structures. 

Over the years, there have been many ad- 
vances based on modification of hydrother- 
mal crystallization that have led to improved 
control over zeolite topology. The develop- 
ment of organic structure-directing agents 
(OSDAs) has been especially fruitful (3), but 
other approaches to controlling zeolite topol- 
ogy, such as using solvents (4) and chemical 
compositions that have structure-directing 
properties (5), have expanded the range of 
possible zeolites that are available. An im- 
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portant goal of this work is the generation 
of so-called extra-large-pore materials that 
can handle larger substrates (6). As a result 
of extensive research, understanding of the 
features of hydrothermal crystallization that 
promote larger pores in the final material has 
improved. These features include the correla- 
tion between the density of the zeolite and 
the ring sizes that are present in the mate- 
rial—the smaller the average ring size, the 


From one dimension to three 
Precise organization of one-dimensional units, 
such as the chains present in ZEO-2, leads to 

the possibility of an irreversible condensation 
reaction to form ZEO-3, a zeolite with unusual and 
surprising three-dimensional topology. The success 
of this approach relies on the relative organization 
of the silicate chains in ZEO-2: They are almost 
perfectly arranged and held together by hydrogen 
bonding, which allows them to “click” together ina 
condensation reaction to form a covalently bonded 
extra-large-pore zeolite. 


@ Oxygen © Silicon 


--- Hydrogen bonds 


ZEO-2 


Heat 


“Click” 


H,0 


lower the density (7). In the study of Li et 
al., the synthesis of a zeolite they call ZEO-3 
seems to break this correlation and produces 
a material with unexpectedly high porosity 
for its large average ring size. 

The difference between the traditional ap- 
proach to zeolite synthesis and that for the 
preparation of ZEO-3 is the final, framework- 
forming step in the process (see the figure). 
Rather than a reversible crystallization, the 
last step is an irreversible condensation re- 
action of a precursor material called ZEO-2. 
ZEO-2 contains chains of silicate that are 
held together by hydrogen bonds that un- 
dergo a condensation reaction upon heating 
to produce ZEO-3, which contains chains 
that are then connected by stronger cova- 
lent bonding. The reaction is topotactic in 
that the structure of the chains themselves 
remains untouched by the process: Only the 
interchain bonding is affected. The conden- 
sation reaction is reminiscent in many ways 
of the click chemistry approach that has been 
so successful in organic chemistry and that 
won the 2022 Nobel Prize in Chemistry. The 
difference between the two processes (apart 
from the difference in the chemistry in- 
volved) is that for the formation of a crystal- 
line solid based on a fully connected lattice, 
the clicking together needs to be a concerted 
process over a long range of chain length. The 
key to this being possible is the relative ar- 
rangement of the chains in ZEO-2. The use of 
an unusual phosphorus-containing OSDA in 
the synthesis of ZEO-2 leads to the chains be- 
ing almost perfectly arranged such that they 
click seamlessly into place in the final mate- 
rial. Topotactic condensations of 2D (layered) 
precursors have been carried out previously 
(8) to provide new zeolites, but a 1D-to-3D 
transformation had not been demonstrated 
before. Applying a similar approach to the 
condensation of chains results in a surpris- 
ingly high degree of crystallinity and defect- 
free final material. 

What really makes this work intriguing, 
however, is the unexpectedly high porosity 
of the final material. It has been noted be- 
fore that changing the mechanism of zeolite 
synthesis, such as in the so-called assem- 
bly-disassembly-organization-reassembly 
(ADOR) method, can lead to unexpected 
materials (9, 10). This route has been dem- 
onstrated for small- and medium-pore zeo- 
lites, but the preparation of the extra-large- 
pore zeolite, ZEO-3, opens up the possibility 
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of targeting materials that can be used to 
catalyze reactions of large substrates. That 
the preparation of ZEO-3 is even possible 
implies that the previous “rules” for target- 
ing high-porosity zeolites are mechanism de- 
pendent and that finding new mechanisms 
will inevitably lead to new topologies. This is 
particularly attractive given the large num- 
ber of hypothetical zeolites that have been 
computationally predicted (77) but not yet 
realized in the laboratory. In addition, Li et 
al. show that elements such as titanium can 
be incorporated into ZEO-3, which is bene- 
ficial because such elements doped into the 
zeolites provide the active sites where ca- 
talysis takes place. In the case of titanium, 
this opens up the possibility of new redox 
catalysis. Developing such active zeolites is 
vital work because transferring their utility 
to new technologies, such as the formation 
of biodegradable plastics from renewable 
sources (12), is key to developing a society 
that is less reliant on petroleum products. 

The next question is whether the same 
type of chemistry can be done to form zeo- 
lites with different, even larger, pore struc- 
tures. The real key in the work of Li et al. 
is the almost perfect organization of the 
precursor, ZEO-2. Can the same level of 
organization be produced with other low- 
dimensional silicate materials (layers, chains, 
or even molecules)? In ZEO-2, the organiza- 
tion came about through a crystallization 
process, but an expansion of the potential 
of this mechanism would come from more 
active control of organization. For example, 
there have been advances in the high-yield 
preparations of exfoliated zeolitic nanolay- 
ers (13)—can such materials be organized 
so that they can be clicked together in the 
same way as ZEO-3? It is a major challenge 
to have such control over organization, but if 
it is possible, the resulting zeolites could also 
break the current “rules” and have structural 
properties different from those prepared us- 
ing traditional methods, in turn opening up 
new applications. 
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A strained ring for 
stereoselective synthesis 


The spring-loaded tension of a cyclic 
allene expedites synthesis of a natural product 


By Christian L. Jankovic and F. G. West 


ffective chemical synthesis of a com- 

plex natural product in a laboratory 

flask requires careful planning so that 

the smallest possible number of reac- 

tion steps is used and they all occur 

at high chemical yields. These efforts 
can be thwarted when key strategic steps 
(forming important chemical bonds or es- 
tablishing stereogenic centers) are unselec- 
tive—that is, they furnish undesired reac- 
tion products instead of or in addition to 
the one needed to complete the synthetic 
sequence. To overcome these challenges, 
new types of transformations are sought 
that proceed with high selectivity under the 
mildest possible conditions. On page 261 
of this issue, Ippoliti et al. (7) describe the 
use of a new class of reactive intermediates, 
cyclic allenes, in the enantioselective syn- 
thesis of the alkaloid lissodendoric acid A. 
This synthesis highlights how cyclic allene 
chemistry can work in tandem with other 
methods to obtain daunting products in a 
selective fashion. 

Allenes are three-carbon structures con- 
taining two alkene w bonds that typically 
exist with the carbon atoms arranged lin- 
early and with the groups attached to the 
terminal carbons in a 90° orientation to 
each other (see the figure). However, when 
this functional group is embedded in a six- 
membered ring, the linear geometry is no 
longer possible, causing a deformation that 
bends the ends toward each other and ro- 
tates the terminal groups closer to planarity. 
This distortion imparts considerable poten- 
tial energy—ring strain—into the molecule, 
and the release of this energy through a 
reaction that relieves the ring strain makes 
cyclic allenes highly reactive (2, 3). Indeed, 
six-membered cyclic allenes are so reactive 
that they exist only transiently and cannot 
be isolated (2). This high reactivity has been 
demonstrated by the observation of many 
types of trapping reactions that occur at or 
below room temperature (4-8). 
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Reactive strained intermediates have 
been used widely as building blocks for com- 
plex and functional molecules. Benzynes, 
aromatic rings that possess an endocyclic 
carbon-carbon triple bond, have long been 
the premier example of this concept, with 
many total syntheses using these fascinat- 
ing molecules in crucial bond-forming re- 
actions (9). Another class of strained mole- 
cules, cyclic alkynes, played an essential role 
in the development of biorthogonal chem- 
istry, which was awarded with the 2022 
Nobel Prize in Chemistry (10). The potential 
energy embedded in strained cyclic alkynes 
allows for the facile covalent linkage of bio- 
logically relevant moieties within living sys- 
tems. However, in contrast to these strained 
ring molecules, cyclic allenes—particularly 
those of smaller ring sizes—have thus far 
found limited application. Only now are 
cyclic allenes emerging as candidates for 
use in the construction of biologically and 
synthetically useful scaffolds. For example, 
the high reactivity of cyclic allenes was har- 
nessed in the preparation of DNA-encoded 
libraries (17), and the total synthesis of lis- 
sodendoric acid A reported by Ippoliti et 
al. provides another important illustration 
of their potential value. In particular, the 
synthesis of lissodendoric acid A exploits 
a critical difference between cyclic allenes 
and other strained intermediates: their in- 
trinsic chirality. 

Chirality usually results from the pres- 
ence of one or more atoms with four differ- 
ent groups attached. A distinct feature of 
cyclic allenes is their inherent axial chiral- 
ity, in which each enantiomer can furnish a 
different product. Although typically gener- 
ated as 50:50 mixtures of both enantiomers, 
methods exist to enable the selective gener- 
ation of one cyclic allene enantiomer (5, 72). 
A reaction that uses this single enantiomer 
would be expected to furnish a predomi- 
nant product with one three-dimensional 
arrangement, which is needed if the goal 
involves laboratory synthesis of a naturally 
occurring compound. Alkaloids such as lis- 
sodendoric acid A are usually formed in 
nature as a single enantiomer. For natural 
products with therapeutic properties, gen- 
erally a single enantiomer will be biologi- 
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Adding to the synthesis toolbox 


Allenes are three-carbon structures that typically exist in a linear arrangement (top left). However, when 
this functional group is embedded in a six-membered ring, the geometry becomes distorted, and this imparts 
ring strain into the molecule, making cyclic allenes highly reactive (top right). The enantioselective 
generation of a cyclic allene intermediate can be used to synthesize the central core of the natural product, 


lissodendoric acid A (bottom). 
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cally functional, whereas the others may be 
less effective or even harmful to living cells. 
Ippoliti et al. observed that the central core 
of lissodendoric acid A could be constructed 
with a Diels-Alder reaction that uses a cyclic 
allene, a reaction that would give access to 
the central fused ring scaffold. After care- 
ful validation of this hypothesis, the authors 
found a set of conditions that allowed for the 
high-yielding synthesis of a cyclic allene in- 
termediate, using a fluoride ion to cause an 
elimination reaction of a silicon-containing 
precursor. This reaction proceeded with a 
high degree of stereospecificity for the gen- 
eration of the required enantiomer. A re- 
markably concise synthetic route was then 
applied to obtain lissodendoric acid A. This 
sequence of steps included the extrusion of 
carbon dioxide at a key point to introduce 
a 1,3-diene present in lissodendoric acid A, 
along with a highly effective ring-closing 
metathesis reaction to form a challenging 
14-membered ring. The elegance of this ap- 
proach to a complex alkaloid natural prod- 
uct was made possible through the use of an 
enantiomerically enriched and highly reac- 
tive cyclic allene intermediate. Furthermore, 
this laboratory synthesis also confirmed the 
stereochemical assignment of the naturally 
occurring functional enantiomer, which had 
previously been surmised but unproven. 
Although the existence of cyclic allenes has 
been known for many decades, the study by 
Ippoliti et al. details the advantageous use of 
cyclic allenes in the chemical synthesis of a 
complex natural product. This achievement 


238 20 JANUARY 2023 * VOL 379 ISSUE 6629 


ee 


Bent (and highly strained) 


BocN 


Cyclic allene 


cs) ‘;@ 
cos = 133%" 
pO 


Key new strategic 
bonds formed 


demonstrates that cyclic allenes can func- 
tion as efficient building blocks en route 
to biologically intriguing molecules previ- 
ously found only in minute quantities from 
natural sources. Access to a robust and 
selective reaction expedites development 
of a streamlined synthetic sequence with 
greatly reduced costs of time and reagents. 
At the same time, valuable insights into 
the distinct reactivity of these unusual and 
transient molecules have been uncovered. 
This new process is a welcome addition to 
the synthetic chemist’s toolbox, and this 
work is likely to be one of many future ap- 
plications of cyclic allenes in complex mol- 
ecule synthesis. 
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Germline 
study points 
to sarcoma 
pathways 


Pathogenic variants related 
to mitosis and telomere 
integrity are enriched in 
sarcoma patients 


By Diana Mandelker' and Marc Ladanyi!” 


ermline cancer predisposition is an 

important cause of premature mor- 

tality. Traditionally, gene discovery 

in hereditary cancer involved either 

familial linkage analysis to segregate 

a variant in a family with several in- 
stances of the same cancer type, case-control 
studies for a limited number of genes or 
variants to establish relative risks, or ge- 
nomewide association studies (GWASs) that 
look for the association of common variants 
to cancer development. On page 253 of this 
issue, Ballinger et al. (1) identify genes and 
pathways associated with sarcoma predispo- 
sition, based on a case-control study design 
using whole-genome sequencing (WGS) of 
1644 individuals with sarcoma and 3205 
matched controls without sarcoma paired 
with genetic ontology analysis. They found 
pathogenic germline variants in genes re- 
lated to mitosis and telomere integrity in 44 
sarcoma patients, representing a significant 
enrichment relative to controls and, more 
broadly, highlighting the importance of these 
processes to sarcomagenesis. 

The known spectrum of genes associated 
with sarcoma predisposition is broad, but 
largely limited to rare genetic syndromes 
with other clinical features (2). Ballinger 
et al. identified 14 new candidate sarcoma- 
predisposition genes in seemingly sporadic 
sarcoma cases (see the table). These genes 
clustered functionally in the telomeric and 
mitotic pathways, specifically genes that 
were related to or encoded components of 
the shelterin complex (which consists of six 
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proteins involved in protecting telomere ends 
from being misrecognized as DNA damage 
by DNA repair mechanisms) and the centro- 
some complex (whose proper functioning in 
mitosis prevents chromosome mis-segrega- 
tion and aneuploidy). This suggests that al- 
terations in these biological processes are key 
to sarcoma predisposition. 

Although one of these telomere shelterin 
complex genes, protection of telomeres 1 
(POTD), had already been implicated in pre- 
disposition to angiosarcoma (3, 4), the study 
of Ballinger et al. extends its role in suscepti- 
bility to a wider spectrum of sarcoma types. 
Moreover, the identification of pathogenic 
variants in other shelterin complex genes re- 
inforces the idea that abnormally long telo- 
meres and increased telomere fragility pre- 
dispose to sarcomas. The pathogenic variants 
in shelterin complex genes and centrosome 
complex genes were observed in 44 patients 
with both diverse complex karyotype sarco- 
mas such as undifferentiated pleomorphic 
sarcoma (UPS) and angiosarcoma as well 
as, perhaps unexpectedly, simple karyotype, 
gene fusion-driven sarcomas such as syno- 
vial sarcoma, myxoid liposarcoma (MLPS), 
and alveolar rhabdomyosarcoma (ARMS), 
representing the two broad genomic 
classes of sarcomas (5, 6). 

The study by Ballinger et al. should 


these newly identified genes. 

Additional analyses of somatic alterations 
in sarcomas with these germline pathogenic 
variants may provide more insight into the 
biology of these cancers (7). Some of the 
somatic analyses presented by Ballinger et 
al. may argue against these germline vari- 
ants having a classic loss-of-function role in 
tumorigenesis. For example, for germline 
pathogenic variants in centrosome genes, 
only 2 of 24 of the tumors had evidence of 
loss of heterozygosity (LOH) of the wild-type 
allele. Traditionally, hereditary cancer sus- 
ceptibility genes require a second somatic 
hit (such as LOH) to fully inactivate the gene 
and drive tumorigenesis. Thus, ~80 to 90% of 
ovarian cancers arising in carriers of BRCAI 
or BRCA2 germline variants demonstrate 
LOH of the wild-type allele (8). Additionally, 
four of five gastrointestinal stromal tumors 
(GISTs) in carriers of centrosome gene vari- 
ants had somatic KIT mutations, whereas 
germline pathogenic variants in other GIST- 
associated syndromes usually have wild-type 
KIT (9, 10). Therefore, further work is needed 
to determine whether some of the variants 
identified may act in a haplo-insufficient or 
dominant-negative manner, where loss of the 


wild-type allele is not required for tumori- 
genesis. Most cancer predisposition genes are 
also sporadically inactivated by somatic mu- 
tations, as has been shown for POT] in angio- 
sarcomas (4), so it will be germane to screen 
the other genes nominated by Ballinger e¢ al. 
for somatic mutations in sporadic sarcomas. 

Over the past few years, genetic testing 
of broad cohorts of cancer patients has re- 
vealed that ~10 to 20% harbor pathogenic 
germline variants in cancer susceptibility 
genes (1/-13), despite the absence of clear 
family histories of the cancer types that 
would have triggered germline testing. 
Increasingly, this germline genetic testing 
is becoming integral to the care of cancer 
patients and their families because patients 
can receive targeted therapies [for example, 
germline BRCAI or BRCA2 mutations sensi- 
tize ovarian, breast, and pancreatic cancers 
to poly(ADP-ribose) polymerase (PARP) in- 
hibitors] and at-risk family members can 
undergo enhanced cancer surveillance or 
risk-reducing surgery. Notably, however, 
these genetic studies on large cancer co- 
horts have mostly focused on genes with 
well-established associations to cancer sus- 
ceptibility and therefore do not account for 
cancer hereditability that may result 
from genes not previously linked to 


prompt further work to more firmly 
establish these candidates as sarcoma 
predisposition genes and to deter- 
mine penetrance and the spectrum 
of cancer predisposition they confer. 
Although the study design did not al- 
low for estimates of penetrance, the 
pedigrees suggest a low penetrance for 
sarcoma but a possible shared predis- 
position to melanoma. Specifically, the 
authors find an overrepresentation of 
shelterin complex variants in families 
with melanoma and thyroid cancers, 
in addition to sarcoma. However, these 
family members were not available for 
familial studies to prove that the vari- 
ant segregates with disease to define a 
new cancer susceptibility syndrome. 
Furthermore, the study cohort was 
composed of a diverse mix of sarcoma 
types. Cancer predispositions can be 
specific to sarcoma type. For example, 
germline pathogenic variants in TP53 
(which encodes p53) in people with 
Li-Fraumeni syndrome are associated 
with predisposition to osteosarcomas 
and complex karyotype soft tissue 
sarcomas, but simple karyotype, gene 
fusion-driven sarcomas are not a part 
of Li-Fraumeni syndrome. Therefore, 
studies of larger cohorts of specific 
sarcoma types may refine the cancer 
susceptibility spectrum conferred by 
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Candidate sarcoma predisposition genes 
Pathogenic variants in 14 genes belonging to two broad functional 
classes were identified in 44 individuals with diverse sarcomas. 


Shelterin complex and related genes 

GENE LOCATION SARCOMA TYPES* 

POT1 7q31 WDLPS, UPS (2), angiosarcoma, 
myxofibrosarcoma, MPNST 

SMARCAL1 2435 Osteosarcoma (2), PNET,UPS(2),GIST 


Osteosarcoma, rhabdomyosarcoma NOS, UPS 
UPS, chond 


TIMELESS 12q13 


Chondrosarcoma (2), synovial sarcoma 


TINF2 14q12 Liposarcoma NOS, myxoid liposarcoma, UPS 


Centrosome complex and related genes 


CEP72 5p15 UPS, synovial sarcoma (2), GIST (2), PNET, ARMS 
a . fe a ae eae 
ae rons : ra eure : Se ay Padi lateeewaceeaeensads 
ae . or a i cae foes evenness 
SSNA1 9934 Leiomyosarcoma 


*Each sarcoma type represents one individual unless otherwise specified. 

**One patient with GIST harbored truncating variants in both HAUS4 and HAUS5. 

ARMS, alveolar rhabdomyosarcoma; CEP, centrosomal protein; GIST, gastrointestinal stromal 
tumor; HAUS, HAUS augmin like complex subunit; MPNST, malignant peripheral nerve 
sheath tumor; NOS, not otherwise specified; PCM1, pericentriolar material 1; PNET, primitive 
neuroectodermal tumor; POT], protection of telomeres 1; SMARCALI, SWI/SNF-related 
matrix-associated actin-dependent regulator of chromatin subfamily Alike protein 1; 
SSNAI, SS nuclear autoantigen 1; STAG3, stromal antigen 3; TERF1, telomeric repeat binding 
factor 1; TERF2/P, TERF2 interacting protein; TINF2, TERF1 interacting nuclear factor 2; UPS, 
undifferentiated pleomorphic sarcoma; WDLPS, well-/dedifferentiated liposarcoma. 
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cancer susceptibility, such as those 
identified by Ballinger et al.—hence 
the importance of the unbiased WGS 
approach in this study. By using a 
nuanced experimental approach in- 
volving WGS, an extreme phenotype 
design, and genetic ontology analysis, 
the authors have discovered 14 can- 
didate genes associated with sarcoma 
predisposition. If confirmed, this will 
be critical to the management of in- 
dividuals and families with sarcoma 
because it will enable more at-risk ge- 
netic testing and cancer surveillance. 
Furthermore, this experimental de- 
sign has the potential to reveal more 
genes that confer susceptibility to a 
wide variety of diseases and enhance 
the ability to practice individualized 
genetic medicine in the future. 
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Remaking the Chinese Academy 


of Sciences 


Under pressure to reinvent itself, the CAS should concentrate 
on managing large-scale research infrastructures 


By Xiyi Yang’, Xiaoyu Zhou’, Cong Cao? 


he decades-long reform of China’s 

science and technology system has 

repeatedly ignited a critical question 

about the position of the Chinese 

Academy of Sciences (CAS). The acad- 

emy’s early contributions to China’s 
strategic weapons programs, and indeed, to 
China’s nation-building efforts, have left a 
legacy that partly mirrors the development 
of Chinese science but is almost impossible 
to replicate. But the CAS has been under 
enormous pressure to reinvent itself by pro- 
ducing original and more visible outcomes 
to justify its existence as the most promi- 
nent research institution in China. We dis- 
cuss major challenges faced by the CAS and 
the antecedents of its current reform, a re- 
shuffling of its 100-plus research institutes 
under a new Pioneer Initiative (J). By con- 
centrating resources on big-science centers 
managing large-scale research infrastruc- 
tures (RIs), CAS may be able to enhance 
its uniqueness and strategic importance 
among China’s many research institutions 
and to potentially bolster its future. 


ONGOING S&T SYSTEM REFORM 
After some 30 years’ intensive investments, 
China has achieved remarkable progress 
in science and technology (S&T), at least in 
quantitative terms (2). But scratching be- 
neath that surface reveals a creeping suspi- 
cion that the country’s innovation system is 
still underperforming (3, 4). The long-stand- 
ing criticism of China’s lack of originality, 
along with the intensified global technologi- 
cal competition, has further prompted the 
country to reform its S&T system to become 
a major S&T power in the world by 2050. 
The Chinese government’s most recent na- 
tional 5-year plan, announced in 2020 with 
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objectives for 2035, sets three major targets 
for S&T reform. First, China will concentrate 
resources and especially accelerate invest- 
ments to strengthen its strategic S&T force. 
National laboratories, an important fixture 
of the strategic S&T force, would be formed 
to tackle key technological problems faced by 
the country. 

Second and related, China will continu- 
ously invest in the construction of big-science 
Rls. Globally, big-science RIs have been the 
foundation for some of the most success- 
ful national laboratories, including the Oak 
Ridge National Laboratory in the US, the 
European Organization for Nuclear Research 
(CERN) in Switzerland, and the High Energy 
Accelerator Research Organization (KEK) in 
Japan. By using big-science RIs, such labo- 
ratories undertake not only basic research 
but also research tasks with strategic goals. 
Furthermore, the construction and operation 
of these laboratories and the associated big- 
science RIs are often coordinated by govern- 
mental institutions or transnational research 
organizations that go beyond the capability 
of a single university or research institution. 

Third, China’s ongoing efforts of forming 
a market-oriented and enterprise-centered 
innovation system require a more delicate 
division of labor in its different actors. Mar- 
ket-oriented research and innovation should 
gradually be transferred to enterprises. Uni- 
versities should turn out the next generation 
of talents who are able to handle increasingly 
complex tasks. And national laboratories 
should focus their efforts on cutting-edge re- 
search that fits into national agendas, or the 
research in Pasteur’s quadrant (5, 6). 

In this context, as China’s largest and pre- 
mier research institution, the CAS needs to 
refine its role. Headquartered in Beijing and 
with an employment of 69,000 and a post- 
graduate student body of 79,000, the acad- 
emy currently comprises 11 branches and 
more than 100 institutes throughout the 
country. During his visit to the CAS in July 
2013, China’s President Xi Jinping, while af- 
firming the academy’s past contributions, 
also challenged it to become a pioneer in four 


The Large High Altitude Air 
Shower Observatory (LHAASO), 
shown in June 2022, was built 
in Daocheng County, Sichuan 
Province, to detect cosmic rays. 


aspects: achieving leap-forward development 
in S&T, nurturing a national innovative tal- 
ent pool, enhancing its role as the nation’s 
premier S&T think tank, and building world- 
class research institutes. Apparently, the po- 
litical leadership is still unsatisfied with the 
academy’s most recent performance, espe- 
cially given its consumption of a substantial 
amount of state funding (in 2021, it operated 
on an S&T expenditure of USD$13.26 billion 
in 2022 dollars, including USD$5.77 billion, 
or 43.51%, from government appropriation, 
and its R&D expenditure reached USD$12.52 
billion, considerably more than any other 
Chinese institutions of learning). 


MAJOR CHALLENGES FACING THE CAS 
Early on, the division of labor between uni- 
versities focused on education, and industrial 
R&D institutes focused on problems of their 
own sectors, justified the existence of the CAS 
as a national academy. From the 1950s to the 
1970s, the CAS demonstrated its value, espe- 
cially by its contributions to the success of 
China’s strategic weapons programs, through 
which it also leveraged to achieve substantial 
disciplinary development. 
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In the mid-1980s when China started to 
reorganize its S&T system, the CAS adopted 
a “one academy, two systems” strategy that 
required the majority of its staff to engage 
in commercialization and projects of direct 
relevance to the economy while concentrat- 
ing a small number of scientists on basic 
research and on following the global trend 
in high and new technology. Consequently, 
the academy’s overall research quality and 
ability to tackle fundamental research ques- 
tions were undermined. 

The launch of the Knowledge Innovation 
Program (KIP) in 1998 was to directly ad- 
dress these problems and the academy’s un- 
certain future. The CAS initially retrenched 
its staff, aiming to make its institutes nim- 
ble and its workforce mobile to better re- 
spond to new challenges (7). However, the 
downsizing of the academy, along with the 
strengthening of research capability in uni- 
versities and the national defense sector, 
had threatened its existence. In response, 
in the late stage of the KIP, the CAS not 
only reversed its earlier approach but also 
swung to the other extreme by substantially 
expanding its reach. It established research 
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institutes in emerging scientific disciplines 
and in cities where it did not have a pres- 
ence. It also formed alliances with provin- 
cial and local governments and industrial 
organizations to help raise technological 
content of the local economy, thus deviating 
from its mission as a national academy. 
Entering the 21st century, the CAS was 
further challenged for its vitality in China’s 
S&T system. The research areas of many 
CAS institutes overlapped with each other 
and with those of universities, and a num- 
ber of projects repeated one another un- 
necessarily. The quality of research was not 
satisfactory (8). The academy also faced an 
increasing challenge from top universities 
in recruiting talents. For instance, among 
the 667 “Young Thousand Talents” recruited 
in 2015, 94 went to CAS-affiliated organiza- 
tions while 43 and 42 went to Tsinghua and 
Peking University, respectively. 
Furthermore, there was a lack of interest 
among CAS scientists in seeking opportuni- 
ties to apply their research, for two primary 
reasons. First, the Chinese government has 
been using its wealth of funds and politi- 
cal will to push innovation from the top. 


A substantial proportion of research proj- 
ects, including those done at the CAS, were 
state led instead of market oriented, which 
limited the opportunity of commercializa- 
tion (4). Second, performance evaluation in 
most CAS institutes determined that scien- 
tists had little incentive for commercializa- 
tion. Following the global trend to conduct 
research and being the first to publish re- 
mained the primary goal of CAS scientists. 

In addition, the academy’s multiple func- 
tions and missions—from research, talent 
training, strategic high-tech development, 
commercialization and local engagement 
to the provision of policy advice as a think 
tank and through its honorific members— 
had made both the overall organizational 
management and the evaluation of individ- 
ual institutes and scientists across the CAS 
extremely difficult. Indeed, the crux is more 
than that of funding or politics, but rather 
is related to its institutional setting with its 
unclearly defined tasks overwhelming and 
unsettling the CAS and its scientists (9). 

In August 2014, just 1 year after Presi- 
dent Xi’s visit, the CAS proposed the Pio- 
neer Initiative to reform itself in another 
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unprecedented way. This time, it proposed 
to reorganize all its institutes into four cat- 
egories: centers of excellence focused on 
basic research, innovation institutes do- 
ing applied research, big-science centers 
managing large-scale RIs, and specialized 
institutes that address problems specific to 
a region of China (J). Those institutes that 
did not fit into the restructuring would be 
merged, moved to the three universities 
under its jurisdiction, or dissolved. The em- 
phasis on basic research, commercialization, 
and regional development is like putting old 
wine in a new bottle as they were the cen- 
tral themes of earlier reforms like the KIP, 
and doing so would not differentiate the CAS 
much from China’s many other R&D organi- 
zations and universities. By contrast, centers 
of big-science RIs may bring real opportuni- 
ties for the CAS to prove its uniqueness and 
strategic importance in China’s S&T system. 


BIG-SCIENCE INFRASTRUCTURE 

“Big science” is a style of scientific research 
characterized by the usage of large-scale 
Rls by groups of scientists to address grand 
challenges of human society. Big-science RIs 
have become indispensable for scientific 
breakthroughs in physics, astronomy, mate- 
rials science, biomedical science, and many 
other areas. For instance, with synchrotron 
radiation sources, imaging facilities, relativ- 
istic heavy-ion colliders, and free-electron 
lasers, the Brookhaven National Laboratory 
has spawned at least seven Nobel Prizes 
since 1947. Similarly, high-energy physics 
and cosmology discoveries have brought nu- 
merous Nobel Prizes to CERN, where some 
of the world’s largest and most complex RIs 
are located. International collaboration has 
also been strengthened as the construction 
and operations of big-science RIs involve 
substantial resources and knowledge trans- 
fer from other countries. 

Initially, the US, European Union, and Ja- 
pan were key players in hosting big-science 
Rls. Major facilities include Stanford Linear 
Accelerator Center (SLAC) in the US, Large 
Hadron Collider (LHC) in CERN, and the 
SuperKEKB in KEK of Japan for research 
on particle physics, as well as major syn- 
chrotron light sources such as Advanced 
Photon Source (APS) in the US, Super Pho- 
ton Ring-8 GeV (SPring-8) in Japan, and 
European Synchrotron Radiation Facility 
(ESRF) in France. Later on, some develop- 
ing economies also started to invest in RIs. 
For instance, the Synchrotron-light for Ex- 
perimental Science and Applications in the 
Middle East (SESAME) and Sirius, the new 
synchrotron light source in Brazil. 

As a latecomer in scientific research, China 
began to lay out its own big-science RIs in 
the 1980s (0). In the 21st century, mainly 
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through the Medium and Long-Term Plan for 
the Development of Science and Technology 
(2006-2020), the country further initiated a 
series of mega-programs that involved gen- 
erous government investment in big-science 
Rls. Consequently, China is now home to a 
few of the world’s largest RIs, such as the 
Five-hundred-meter Aperture Spherical Tele- 
scope and the Shanghai High Repetition-rate 
X-ray Free Electron Laser and Extreme Light 
Facility (SHINE) (see the table for a list of 
major big-science RIs in China). 


REMAKING THE CAS 

According to its self-assessment, by 2020, 
the CAS had accomplished the initial step 
of the Pioneer Initiative by restructuring 
most of its research units. From 2020 to 
2030, it will continue reforming its person- 
nel system and mobilizing resources across 
the four categories of institutes to become 
the country’s main sources of innovative 
ideas, talents, and scientific achievements, 


as challenged by President Xi. Compared 
to the other three categories, concentrating 
resources on the big-science centers may 
bring more distinctive competitiveness to 
the CAS to meet challenges from top univer- 
sities and other research institutes as well 
as industrial R&D endeavors. 

This conclusion is derived from analysis of 
two critical attributes of the four categories 
of institutes: internal resourcefulness and 
external uniqueness. Resourcefulness refers 
to how many resources the CAS has already 
owned and can access, and it determines 
the performance potential of the reformed 
institutes. Uniqueness, taking an external 
perspective, considers how the reformed in- 
stitutes can distinguish or stand out from 
other actors of the S&T system, and defines 
CAS’s strategic role in China’s S&T landscape. 

The centers of excellence face fierce 
competition as the research capability and 
achievements of many universities and 
ministry-sponsored research institutes have 


Major big-science research infrastructure in China 


CONSTRUCTION 
YEAR OF COST (IN RESPONSIBLE 

RESEARCH INFRASTRUCTURE NAME CONSTRUCTION MILLIONUSD) ORGANIZATION LOCATION 
Beijing Electron Positron Collider (BEPC) 1984 165.14 CAS Beijing 
Hefei Light Source 1984 55.32 CAS Hefei 
Shenguang-II High Power Laser 1994 N/A CAS Shanghai 
Heavy lon Research Facility (HIRFL) 2000 69.67 CAS Lanzhou 
Experimental Advanced Superconducting 2000 4747 CAS Hefei 
Tokamak (EAST) 
Large Sky Area Multi-Object Fiber 2001 55.38 CAS Beijing 
Spectroscopic Telescope (LAMOST) 
Shanghai Synchrotron Radiation Facility (SSRF) 2004 271.34 CAS Shanghai 
China Spallation Neutron Source (CSNC) 2007 250.23 CAS Dongguan 
Daya Bay Reactor Neutrino Experiment 2007 N/A CAS Shenzhen 
Steady High Magnetic Field Facility (SHMFF) 2008 49.21 CAS and University Hefei 

of Science 

and Technology 
Five-hundred-meters Aperture Spherical 2009 N/A CAS Guizhou 
Telescope (FAST) 
National Center for Protein Science 2010 45.30 CAS Shanghai 
Soft X-ray Free Electron Laser 2011 36.42 CAS and Peking — Shanghai 

University 
Dalian Coherent Light Source (DCLS) 2014 747 CAS Dalian 
Large High Altitude Air Shower Observatory 2016 N/A CAS Daocheng 
(LHAASO) 
Earth System Numerical Simulation Facility 2018 98.43 CAS and Tsinghua Beijing 
(EarthLab) University 
Shanghai High Repetition-rate X-ray Free Electron 2018 581.11 ShanghaiTech Shanghai 
Laser and Extreme Light Facility (SHINE) University 
High Energy Photon Source (HEPS) 2019 N/A CAS Beijing 
The original costs were first converted to RMB as of 2022 to account for inflation and then converted from RMB to USD based 
on the currency market exchange rate in December 2022. Sources: https://Issf.cas.cn/ CAS, Chinese Academy of Sciences; 


N/A, no publicly available information of construction cost. 
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risen rapidly, with many research areas and 
programs often overlapping with those of the 
CAS. Attracting top talents and applying for 
basic research funding have already been dif- 
ficult for the CAS. It is also uncertain whether 
the new centers of excellence can produce 
more scientific breakthroughs than universi- 
ties or other research organizations without 
exhausting too many resources. The suc- 
cess of the innovation institutes depends on 
whether the CAS can resolve the discrepancy 
between the needs of its scientists and the in- 
dustrial sector or whether it can spin off new 
ventures as it did before. Although scientists 
care more about being the first in their find- 
ings, integrating science with cost effective- 
ness is more vital to the commercialization of 
research and high-tech entrepreneurship. It 
is therefore questionable whether institutes 
in this category can outperform industrial 
R&D institutes where staff have more incen- 
tive and market knowledge for commercial- 
ization. Moreover, the specialized institutes 
focusing on regional issues may also be too 
fragmented for the CAS to generate a national 
impact. Overall, these three types of reformed 
institutes may not warrant their affiliation 
with the national academy as measured by 
their resourcefulness and uniqueness. 

The big-science centers, however, satisfy 
both conditions. The CAS has built and 
administered about 80% of all big-science 
Rls currently operated in China (17), reflect- 
ing its distinctive feature and unmatched 
advantage. It has cumulated experience 
in building and managing these facilities, 
which are fully government financed. For 
instance, the construction of the High En- 
ergy Photon Source, the world’s brightest 
fourth-generation synchrotron facility, is 
supported by special funding from the cen- 
tral government and Beijing municipal gov- 
ernment; funding for constructing SHINE, 
China’s most expensive RI, has come from 
the National Development and Reform 
Commission and Shanghai municipal gov- 
ernment (12). Despite not having a substan- 
tial presence within the CAS, in 2019, 65% 
of the funding from the nine special basic 
science projects secured by the CAS from 
the National Key R&D Program, a program 
administered by the Ministry of Science and 
Technology, went to the big-science centers. 
Pivoting the CAS to these big-science infra- 
structure projects also provides flexibility 
for the academy to establish and extend its 
competitive advantage. By nature, big sci- 
ence’s most ambitious projects—satellites 
and space probes, particle accelerators, and 
gravitational-wave observatory—rival those 
of universities and industrial organizations 
in their size and complexity. 

Furthermore, many existing big-science 
Rls have already generated world-class sci- 
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entific breakthroughs. Examples include 
the discovery of a four-quark matter from 
the Beijing Electron Positron Collider (13) 
and a new form of neutrino oscillations 
from the Daya Bay Reactor Neutrino Ex- 
periment (J4), with the latter involving a 
multinational team with researchers from 
China, as well as Chile, the US, Russia, and 
the Czech Republic. The RIs have also facili- 
tated technological innovations that meet 
the strategic demands of China’s industrial 
and economic development. For instance, 
the development of the BeiDou Navigation 
Satellite System has benefited from the 
BPL/BPM Time Service System, a research 
facility that provides reliable high-precision 
time service to industrial developments in 
China since the 1970s. Some preliminary 
work suggests that the construction of the 
Shanghai Synchrotron Radiation Facility 
(SSRF) enhanced China’s internal innova- 
tion capability (75). Investing in big-science 
centers may thus have the potential to meet 
China’s imperative for scientific excellence 
and technological competitiveness. 

Prioritizing these big-science centers 
also helps settle the recent discussion on 
the establishment of national laborato- 
ries. Rather than diverting and potentially 
wasting China’s increasing but still scarce 
resources to emergent but untested or- 
ganizations, turning part of the CAS into 
national laboratories may be most cost- 
effective and feasible. By doing so, the 
CAS may evolve into a national manage- 
ment agency responsible for the construc- 
tion and operation of all big-science RIs 
in China. The strategic human capital and 
tacit knowledge that the CAS has accumu- 
lated will help it to become the leading big- 
science training center in China, providing 
professional training for research scien- 
tists and architectural engineers in disci- 
plines related to large-scale RIs. Hosting 
big-science RIs at the CAS and turning it 
into national laboratories will reshape the 
academy’s position in China’s S&T system, 
avoiding disorderly competition with uni- 
versities and industrial research institutes. 
Doing so also fulfills a major task of the 
reform of China’s S&T system. 

The CAS may continue to learn from 
its international peers regarding how to 
build and operate national laboratories 
around big-science RIs. Indeed, since 2017, 
the Chinese government has already been 
experimenting with a few national labo- 
ratories in Shanghai, Beijing, Hefei, and 
Guangzhou, with extensive involvement 
of the CAS. However, China has seen a 
recent rush to host new RIs by local gov- 
ernments, but not all of them have a good 
grasp of the necessity and advanced nature 
of the facilities. Given the huge costs, the 


construction of big-science RIs must an- 
ticipate the scientific needs of the future 
instead of merely considering the existing 
demands of a mature body of users. Hence, 
there is a need for coordination of efforts 
as well as for cultivation of a growing user 
community for the RIs. 

The process of turning the CAS into na- 
tional laboratories oriented toward admin- 
istering and supervising China’s big-science 
RIs is complicated and even radical and 
painful, especially given its storied history. It 
will be less a scientific and more a political, 
economic, and administrative decision. The 
CAS leadership may start considering shift- 
ing resources toward the big-science centers, 
within its “Pioneer Initiative” framework. Of 
course, we do not mean to suggest an im- 
mediate separation of other categories of 
institutes from the academy. However, by ori- 
enting resources toward big-science-related 
institutes, the CAS may be able to gradually 
downsize other categories of institutes and 
reduce its redundant or ineffective functions. 
The questions become whether the govern- 
ment would support and facilitate such a 
transformation after carefully weighing the 
costs and benefits and what the CAS should 
do to streamline the process, especially re- 
garding the other parts of the institution. 
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Making modern science 


Victorian values permeate contemporary scientific 


culture, maintains a historian 


By Bernard Lightman 


n How the Victorians Took Us to the 
Moon, Iwan Rhys Morus has drawn on 
his vast knowledge of the history of the 
physical sciences in 19th-century Britain 
to depict the emergence of “new ways of 
thinking about and organising science 
that were directed at the future in a wholly 
new and unprecedented way, and 
some of the key consequences of 8 
that reorientation.” Understand- 
ing this, Morus argues compel- 
lingly, helps us understand our 
current situation, for we mod- 
erns think about and organize 
science in the same way. 

Although Morus develops a 
complex argument throughout 
the book, his elegant and access- 
ible writing style will appeal to 
a variety of audiences, including 
historians of science, scientists, and casual 
readers. His ability to synthesize recent 
scholarship to present a novel, coherent 
story is truly impressive. 

Each of the book’s chapters introduces 
the reader to a different group of scientists 
and engineers who were central to 19th- 
century discoveries in the physical sciences, 
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How the Victorians 
Took Us to the Moon 
Iwan Rhys Morus 
Pegasus, 2022. 400 pp. 


including the transatlantic cable, the rail- 
ways, the telegraph and telephone, calculat- 
ing machines, and flying machines. In each 
case, Morus shows that development of the 
new science and technology led Victorians 
to believe that they were transforming 
the world and that, as a result, the future 
would be very different from the past. In 
other words, scientists and engineers were 
not just inventing new scientific 
theories and technologies, they 
were also “invent[ing] the future 
as we know it.” To illuminate this 
point, Morus frequently refer- 
ences contemporaneous works 
of science fiction, a literary genre 
born in this period. 

Morus’s explanation for how 
the Victorians remade the future 
is tied to how science itself was 
reinvented in the early 19th cen- 
tury. The story therefore begins 
with a new breed of experts, led by John 
Herschel, William Whewell, and Charles 
Babbage, who came on the scene in the 
1820s. These individuals attempted (un- 
successfully, at first) to reform the old Royal 
Society and later, in 1831, created the British 
Association for the Advancement of Science. 
Through it, they shaped the era’s science. 

Victorians considered self-discipline to 
be a masculine virtue, and their approach 
to science—then considered an exclusively 
male pursuit—reflected this. Even a man 
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Workers dig the Thames Tunnel in this engraving 
from the Illustrated London News. 


who came from a humble working-class 
background, like the natural philosopher 
Michael Faraday, could become an exem- 
plary man of science, they believed, be- 
cause men knew how to discipline them- 
selves (unlike women, who were ever “at 
the mercy of their uncontrollable bodies”). 

Morus also draws a connection between 
this image of the disciplined scientist and 
the importance of showmanship to the 
business of invention. He pays consider- 
able attention to scientific and industrial 
exhibitions such as the Adelaide Gallery, 
the Great Exhibition of 1851, and later 
exhibitions and museums in which the 
inventor was as much on show as their 
invention. 

Men of invention, such as Isambard 
Kingdom Brunel, John Henry Pepper, and 
Thomas Edison, were held up as powerful, 
charismatic, and heroic figures during this 
era. As Morus points out, however, this im- 
age of the scientific loner was an illusion. 
Scientific discovery and invention were 
often the product of the efforts of a multi- 
tude of expert workers. 

Another strand of Morus’s argument 
brings in the imperial dimensions of 
19th-century science. “All the technological 
innovations—actual and imagined—de- 
scribed in this book had imperial entangle- 
ments,” he declares. By this, he means that 
they were dependent on the resources of 
an imperial power and that they them- 
selves provided ways of maintaining and 
expanding the reach of empire. 

Morus has given the history of 19th- 
century science something that it has des- 
perately needed: a new big picture that 
integrates the insights of recent scholar- 
ship. Of course, there are gaps. His em- 
phasis on the physical sciences excludes, 
for example, more engagement with recent 
scholarship on the history of the life sci- 
ences, including the theory of evolution. 
And despite the discussion of imperialism, 
Morus spends most of his time analyzing 
British scientists and engineers. But one 
book can only do so much. 

Morus ends with a sobering thought: 
Unreflectingly following the Victorian re- 
cipe for future-making sets limitations on 
how we might remake our own futures. 
There is no escape from the fact that the 
science that governs our lives is the product 
of an imperial, masculine culture. If, Morus 
insists, we want “to change science and its 
culture for the better, we need to start by 
remembering this history.” & 
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ANTHROPOLOGY 


Pirates and politics 


An anthropologist argues that experimental communities 
in Madagascar influenced the European Enlightenment 


By Jatin Dua 


n 1695, as it made its way through the 
Red Sea, the Ganj-i-Sawai, a trading 
vessel belonging to the Mughal em- 
peror Aurangzeb, was attacked and 
ransacked by a ragtag group led by 

Englishman Henry Avery, the notori- 
ous “king of Pirates.” Although European 
pirates had been crucial to the making of 
the British Empire (and other European 
empires), the British sought to distance 
themselves from Avery and his comrades, 
launching a worldwide manhunt against 
him, the first such global operation in re- 
corded history. Avery, however, 
eluded capture and was rumored 
to have sailed to Madagascar, 
where he found a home amid the 
burgeoning pirate communities 
on the northeast coast. 

Avery’s high jinks, along with 
other daring stories of piracy on 
the high seas, captured the pub- 
lic imagination in England and 
throughout Europe. Through fic- 
tional and fictionalized accounts 
and biographies, a proliferation 
of pirate stories—including many 
that featured a legendary pirate 
colony in Madagascar known as 
Libertalia—circulated throughout 
Europe, emphasizing the fantasy 
and romance of piracy, a lore that lives on 
in contemporary renderings. 

In his final posthumously published 
book, Pirate Enlightenment, or the Real 
Libertalia, anthropologist and _ political 
activist David Graeber revisits this histori- 
cal moment, alongside his earliest ethno- 
graphic fieldwork. This elegantly breezy 
treatise takes readers on a journey to the 
monsoonal waters of the Indian Ocean and 
the verdant landscape of Madagascar. 

Piracy, as Graeber compellingly notes, 
is not just myth and romance but also an 
actual political experiment in forms of so- 
cial organization and living that are nonhi- 
erarchical and antiauthoritarian. Divided 
into three parts, Pirate Enlightenment 
draws from a number of contemporane- 
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ous accounts of pirate exploits, colonial ar- 
chives, historiography on Madagascar, and 
Graeber’s own fieldwork in the region to 
create a vivid narrative that seeks to move 
beyond Eurocentric understandings of the 
expansion of piracy from the Atlantic to the 
Indian Ocean in the 17th and 18th centuries. 

European merchants and colonists were 
part of a long history of encounters that 
shaped Malagasy social life. However, un- 
like other travelers who formed complex 
political alliances or incorporated them- 
selves within local society through mar- 
riage, European settlers frequently re- 
mained aloof or used violence as a mode 


Saint Mary’s Island, off the coast of Madagascar, was once a pirate haven. 


of establishing presence in Madagascar, re- 
veals Graeber. The pirates who arrived on 
the northeast coast were far more amena- 
ble to incorporating themselves into local 
systems, he maintains, and were open to 
creating new forms of political, economic, 
and social organization. While acknowl- 
edging the “meager” and often “sensation- 
alistic” nature of sources, Graeber cites 
examples drawn from contemporaneous 
popular accounts where newly settled pi- 
rates played important roles as advisers 
and mediators in establishing new settle- 
ments and polities infused with an egali- 
tarian ethos. Specifically, Graeber notes 
that pirates successfully “convert[ed] the 
democratic institutions first developed on 
board ships into forms that would be vi- 
able on land.” 

In the second part of the book, Graeber 
switches vantage points and examines the 
arrival of pirates from the lens of Malagasy 
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society. Here, he includes a fascinating dis- 
cussion on the agency and importance of 
Malagasy women traders in creating alter- 
native loci of political, economic, and so- 
cial authority. 

Graeber concludes by expanding on 
the provocation that Madagascar’s pi- 
rate colonies can be understood as an 
Enlightenment political experiment that 
resonated far beyond the region and shaped 
emerging notions of liberty, freedom, and 
egalitarianism in Europe at the turn of the 
18th century. Absent a direct line of con- 
nection between these experimental com- 
munities and figures such as Montesquieu, 
Locke, and Hobbes, Graeber bases 
his thesis on both the physical 
presence of numerous “pirate en- 
voys” at European courts and the 
proliferation of pirate tales across 
Europe, arguing that pirates were 
a major topic of conversation and 
“the Enlightenment was an intel- 
lectual movement uniquely tied 
to conversational forms.” 

Pirate Enlightenment plural- 
izes and globalizes our under- 
standing of whose ideas and 
actions are considered impact- 
ful and whose vision shapes the 
world, a framing that still reso- 
nates in contemporary times. 
Many of today’s self-proclaimed 
pirates, for example, would claim that 
they—as coastal villagers, fishermen, and 
traders—are legitimate protectors of the 
sea. Piracy was and is many things, in- 
cluding a demand to be taken seriously as 
political actors. Yet global hierarchies con- 
tinue to divide up worlds into the West and 
the rest. 

In his academic writing and political 
commitments, David Graeber exemplified 
an ethos of action and conversation. There 
is a certain bittersweetness to this text, one 
that ends with an exhortation toward the 
arts of speaking and conversation. Graeber 
himself is no longer around to speak, to de- 
bate, or to inspire protest and action. 

As anthropologists have noted, gifts 
are inalienable—they contain within them 
something of the giver. Graeber’s final book 
is certainly such a gift. 
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Avian flu threatens 
Neotropical birds 


A highly pathogenic avian influenza virus 
(HPAIv) that spread through the Holarctic 
region in 2022, affecting millions of birds 
(1), has reached South America. The 
disease threatens marine and terrestrial 
birds, including endangered species. Real- 
time information gathering and steps 

to prevent the spread of disease will be 
required to mitigate this outbreak. 

Along the Peruvian coastline alone, 
HPAIv killed more than 22,000 wild birds 
in just 4 weeks in 2022, mainly Peruvian 
pelicans (Pelecanus thagus) and Peruvian 
boobies (Sula variegata) (2), both of 
which are categorized as Endangered in 
Peru (3). The extant Peruvian population 
of pelicans in coastal protected areas is 
around 124,000 (4). The virus has also 
killed Near Threatened Guanay cormo- 
rants (Leucocarbo bougainvilliorum) (2, 
3). Beyond their conservation status and 
intrinsic value, these species are of eco- 
nomic interest. The guano produced by 
the Peruvian pelican, the Peruvian booby, 
and the Guanay cormorant is used as 
fertilizer and constitutes one of the most 
valuable economic resources in Peru (5). 

HPAIv is spreading across the conti- 
nent. By December 2022, the virus had 
been found in birds in Ecuador, Colombia, 
Venezuela, and Chile (6). In addition to 
marine birds, the virus could threaten 
scavengers, as it has in other regions of 
the world (7), including the emblematic 
Andean condor (Vultur gryphus) (8). Viral 
transmission could occur if those scav- 
engers consume infected carcasses while 
traveling from the Andean region to the 
coastline (9). 

We urge Peruvian and other South 
American authorities and conservation 
managers to take steps to track and miti- 
gate the spread of this disease. Up-to-date 
information about the wild bird species 
affected and the number of birds remain- 
ing in the region will be necessary to 
understand the extent of HPAIv’s effects. 
Managers should perform active and pas- 
sive epidemiological surveillance and 
reduce the availability of infected carcasses 
in the environment to prevent transmis- 
sion to marine birds, obligate scavenger 
species, and humans. Given the diffi- 
culty of controlling emerging pathogens, 
reducing anthropogenic threats such as 
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environmental pollution, resource overex- 
ploitation, and habitat loss should also be 
an urgent priority. Minimizing other risks 
will give bird populations a better chance 
against emerging pathogens. Finally, we 
recommend continuing these efforts even 
if this flu event passes; the epidemiological 
behavior of this virus indicates that recur- 
rent outbreaks are likely (J0). 
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Atmospheric goals for 
sustainable development 


Atmospheric chemistry and composition 
underlie the existential threats of climate 
change and ozone depletion (7), and 
poor air quality represents the greatest 
environmental health issue in the mod- 
ern world (2). However, the only targets 
and indicators related to atmospheric 
health and clean air in the United Nations 
Sustainable Development Goals (SDGs) 
focus on reducing fine particulate pollu- 
tion levels and the associated mortality 
rates (3, 4). By creating targets for a broad 
range of specific pollutants, the SDGs 
could facilitate more effective actions, 
monitoring, and funding. 

The SDGs consider air quality only 
in terms of the mass density of particu- 
late matter of 2.5 4m or less (PM2.5). 
However, PM2.5 is not a single pollutant 
but rather a measurement that contains 
many different compounds, including 
sulfate, black carbon, metals, and organic 
compounds of all particle sizes below 2.5 
ym. Measuring PM2.5 does not provide 
specific information about the concentra- 
tions of the components that make up 
PM2.5, nor does it account for the impact 
of other air pollutants known to be harm- 
ful, such as ozone and nitrogen dioxide. 
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Given that we still cannot fully explain 
the cause of the statistical link between 
PM2.5 and adverse health outcomes (5, 6), 
this measurement alone does not provide 
adequate information. The SDGs would be 
more valuable if they also included targets 
to reduce deaths from specific pollutants. 
Particle number concentration and chemi- 
cal composition may be more important 
than fine particulate matter mass density 
for health outcomes (7, 8). Moreover, 
instead of tracking excess mortality alone, 
targets should include nonfatal health 
impacts, as well as impacts on ecosystems 
and agriculture. 

Metrics for reduction of other atmo- 
spheric pollutants could be added 
to future developments of the SDGs. 

For example, in SDG 10 (Reduced 
Inequalities), reducing pollutants could 
contribute to access to clean air to 
breathe, a major contributor to health 
inequalities (9). Targets that address 
stratospheric ozone chemistry could be 
added to SDG 15 (Life on Land), given that 
life is only possible because of a healthy 
atmosphere, including the ozone layer 
(10). Inclusion of targets for tropospheric 
ozone would increase the pressure to 
control atmospheric pollution holistically, 
which—if ozone levels were successfully 
reduced—would also increase agricultural 
yields (SDG 2, Zero Hunger) (11). SDG 13 
(Climate Action) could improve transpar- 
ency by providing explicit targets for the 
peak concentrations of all major climate 
forcers, including greenhouse gases and 
particulates, which would help to track 
our progress in meeting global climate 
change commitments. 

Updating the SDGs to incorporate 
atmospheric targets would improve data, 
policies, and incentives. Better regional 
policies on air pollution control and 
monitoring would also be supported in 
accordance with the broader guidance. In 
addition, a more holistic strategy would 
provide incentives for funders to support 
scientists researching pollutants that are 
not categorized as PM2.5. 
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Evidence required for 
ethical social science 


The history of science is littered with vio- 
lations of the human rights of research 
participants (J, 2). One response has 
been regulation. In the United States, the 
Belmont Report lays out ethical principles 
for human subjects research (3), which 
form the foundation of modern-day insti- 
tutional review boards. According to the 
Belmont Report, ethics committees and 
investigators need to determine whether 
the benefits of the research outweigh the 
costs and whether the costs unevenly 
burden particular groups. However, the 
assumptions underlying such assessments 
can be flawed (4, 5). Without evidence 
about the causal effects of social science 
research on participant welfare, research- 
ers risk causing unexpected harms or 
missing unexpected benefits. 

Randomized experiments can provide 
data to replace assumptions about the 
potential benefits and costs of research on 
study participants. By randomly assigning 
some people to participate in a study and 
others not to participate, researchers can 
estimate the causal effects of participa- 
tion on welfare (6). Researchers can also 
randomly assign and compare multiple 


versions of a protocol to learn about the 
marginal costs and benefits to participants, 
or they could randomize the sequence of a 
study component and participant welfare 
questions (7). Finally, researchers can add 
participant welfare questions to the end of 
an existing experimental study to under- 
stand the costs and benefits of a treatment. 
In each case, the experiments should be 
designed to assess whether costs and bene- 
fits are distributed equally among different 
subsets of participants. 

Research ethics has come a long way 
since the Belmont Report, but research- 
ers who work with human subjects still 
lack the causal evidence to make fully 
informed ethical choices. Researchers 
know they can do better, as calls for eth- 
ics appendices or ethics sections in pre- 
registrations illustrate (8, 9). To facilitate 
the adoption of these strategies, funders 
could require more transparency on the 
ethics of the work they support. The 
experimental designs we propose can be 
embedded into ongoing research efforts, 
adding little expense. Researchers can also 
coordinate efforts by using open access 
forums to post study instruments, data, 
and results, and when appropriate can 
implement harmonized experiments to 
build generalizable evidence across cases 
(10). Randomized experiments that assess 
the effects of participating in research 
can augment existing practices and build 
credible evidence to inform future risk- 
benefit calculations and improve ethical 
decision making. 
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PLANT SCIENCE 
Parsing nodulation 
pathways 


Legumes benefit from sym- 
biotic microbes resident in 
root nodules that fix nitrogen, 
whereas most other plants 
depend on externally sup- 
plied or gathered nitrogen. 
Rubsam et al. identified a 
bifunctional receptor complex 
in the model legume Lotus 
japonicus that initiates devel- 
opment and infection of the 
root nodules. Development 
of root nodules could be 
driven solely by the intracel- 
lular domains containing the 
kinases once the receptor 
complex was formed, but 

the Nod factor—recognizing 
ectodomains were required 
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be like, and different strategies can affect f 
overall fitness. Petrullo et al. studied differ- 

ent strategies in a population of red squirrels 

that has been monitored for decades. They 

found that mother squirrels that bet on a good 
environment, and produced more offspring, 

had higher overall fitness than mothers who bet 

that the environment would be poor. This was 

the case even when the optimistic mothers were 
wrong. —SNV Science, abn0665, this issue p. 269 


Over a lifetime, American red squirrel (Tamiasciurus 
dsonicus) mothers successfully raise more young when 
they assume that their environment will be favorable. 


to support rhizobial infec- 
tion. Receptors identified in 
the cereal barley, which does 
not form symbiotic nitrogen- 
fixing root nodules, were able 
to support Lotus root nodule 
organogenesis, suggesting that 
the path to nitrogen fixation in 
cereals might be shorter than 
we thought. —PJH 

Science, ade9204, this issue p. 272 


SOLAR CELLS 
Two additives for 
perovskite modules 


The performance of large-area 
perovskite solar cells and mod- 
ules fabricated by the vacuum 
flash solution-processing 
method has been enhanced 
with two different additives. You 
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n nature, environmental fluctuation is com- 
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et al. synthesized a fullerene 
derivative with a phospho- 
nate group that accelerated 
electron transfer across grain 
boundaries in the perovskite 
film and suppressed ion 
migration. They also prepared 
a polypropylene oxide polymer 
substituted with a redox 
active species that could 
facilitate p-doping of the hole 
transport layer quickly, unlike 
the conventional air-doping 
approach. Large-area modules 
(17 square centimeters) had 
power-conversion efficiencies 
of up to 21.4%. The modules 
retained 95% of their effi- 
ciency after more than 3000 
hours of continuous illumina- 
tion at elevated temperature. 
—PDS 

Science, add8786, this issue p. 288 


QUANTUM SIMULATION 
Ahybrid platform for 
quantum simulation 


Quantum simulators are typi- 
cally constructed from a set of 
quantum particles that are con- 
trollably placed on a lattice and 
then allowed to interact with 
each other, but there are limita- 
tions. Simulators based on 
neutral atoms lack the flexibility 
to independently control and 
read out single atoms, trapped- 
ion based quantum systems 
are difficult to scale beyond 
tens of ions, and superconduct- 
ing quantum circuits are limited 
to local interactions between 
qubits. Zhang et al. con- 
structed a many-body quantum 
simulator by interfacing 
superconducting qubits with 
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a microwave photonic band- 
gap metamaterial waveguide. 
This hybrid superconducting 
qubit-metamaterial approach 
represents a route toward devel- 
oping a large-scale quantum 
simulator platform, extending 
the lattice to two dimensions 
and hosting a larger number of 
quantum particles. —ISO 
Science, ade7651, this issue p.278 


OPTICS 
For more information, 
make some noise 


Holograms can be considered 
large-capacity memory stor- 
age media that are capable 
of holding information about 
three-dimensional scenes 
and compressing it into a 
two-dimensional pattern (meta- 
surface). Attempting to store 
more images onto the one holo- 
gram pattern tends to result in 
cross talk and corruption of the 
stored data. Xiong et al. found 
that introducing engineered 
noise into the process enabled 
an increase in storage capacity. 
This approach should work for 
metasurfaces in applications 
such as high-capacity optical 
displays, information encryp- 
tion, and data storage. —ISO 
Science, ade5140, this issue p. 294 


STELLAR ASTROPHYSICS 
Asimulated dynamo 
in radiative stars 


Strong stellar magnetic fields 
are generated by dynamos, 
which typically require con- 
vection of material inside the 
star. Petitdemange et al. used 
numerical simulations to show 
how a stellar dynamo can form 
in layers that are purely radia- 
tive (meaning that they have 
no convection). Their results 
matched most of the proper- 
ties of a theoretical prediction 
known as the Tayler-Spruit 
dynamo. The resulting magnetic 
field remains trapped inside the 
star, so it would not be observ- 
able on the surface but could 
potentially be inferred using 
asteroseismology. Because 
magnetic fields can transport 
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angular momentum, the 
mechanism slows the spin of 
stellar cores and increases the 
rotation of the outer layers, 
enhancing the mixing of chemi- 
cal elements. —KTS 

Science, abk2169, this issue p. 300 


CANCER 
SCD-ing into home 
for GBMs 


Glioblastomas (GBMs) are 
highly enriched with GBM 
stem-like cells (GSCs) that 
cause resistance to standard 
therapy. These cells rely on 
deregulated de novo lipid 
synthesis, which represents a 
potential target for treatment; 
however, known inhibitors are 
not brain penetrant and suscep- 
tibility is unknown. Eyme et al. 
found that the stearoyl CoA 
desaturase inhibitor YTX-7739 
was able to trigger lipotoxicity 
in patient-derived GSCs and 
showed therapeutic efficacy in 
mouse models. Mechanistically 
aberrant MEK/ERK signaling 
endowed sensitivity to this 
enzyme inhibitor, whereas 
AMPK activation rendered the 
GSCs resistant. —-DLH 

Sci. Trans/. Med. 15, eabq6288 (2023). 


PHYSIOLOGY 
Ahepatokine 
elevates lipolysis 


Increased circulating triglycer- 
ide levels are a risk factor for 
the development of cardiovas- 
cular and fatty liver diseases. 
Kim et al. identified a hepa- 
tokine called CREBH-C that 
stimulates the lipolysis and 
clearance of circulating triglyc- 
erides. CREBH-C is secreted 
from the liver in response to 
increased energy demands 
(such as those induced by 
fasting) or hepatic stress (Such 
as that induced by obesity or 
a high-fat diet) in mice and 
humans. CREBH-C treatment of 
mice fed a high-fat diet reduced 
circulating triglyceride amounts 
and increased triglyceride 
uptake by nonhepatic tissues. 
—WW 

Sci. Signal. 16, eadd6702 (2023). 
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NEUROSCIENCE 


Edited by Caroline Ash 
and Jesse Smith 


Dynamic cortical Down states 


hen asleep, under anesthesia, or in some damaged 
states, the brain exhibits slow oscillations. These 
alternate between active periods with neuronal firing 
called Up states and quiescent periods associated 
with unconsciousness called Down states. The 
mechanisms underlying alternations between Up and Down 
states are still not fully understood. Caramassa et al. analyzed 
Up and Down state transitions and found that they contain 
two complementary periods that determine relative respon- 
siveness: A highly synchronized period after the Up-to-Down 
transition with a time course that is defined by the exponen- 
tial decay of the phase and another stochastic period. The 
work offers some insight into how the brain slips in and out of 
unconsciousness. —PRS J. Neurosci. 42, 9387 (2022). 


SIGNAL TRANSDUCTION 
Designer anti- 
inflammatory receptors 


The modular nature of pro- 

tein interactions that mediate 
cellular signal transduction 
offers the opportunity to design 
therapeutics. In the human body, 
hundreds of billions of cells are 
estimated to die each day, and 
they are mostly disposed of by 
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a process called efferocytosis. If 
efferocytosis fails, then damag- 
ing inflammation can ensue. 
Morioka et al. constructed recep- 
tors that combined a sensor 
domain recognizing phosphati- 
dylserine on the surface of dead 
cells with an adapter domain 
from a cytoplasmic protein 
called ELMO that helps to propa- 
gate signals for efferocytosis. 
Expression of such chimeras in 
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BIOSENSORS 


Spying on the microbiome 


ut microbiota play important roles in animal digestion and immune response, and many 
microbes are capable of metabolic transformations not performed by the host. Vaaben 
et al. generated and characterized eight biosensor plasmids that, when transformed into 
a suitable bacterial host, could report on the presence of oxygen, by-products of metabolism, 
and host-derived molecules in vitro and in a model worm. This collection of sensors could 

in theory be adapted to report on multiple stimuli at once to build a more complete picture 

of the gut environment in model animals. -MAF ACS Synth. Biol. 11, 4184 (2022). 


Bioengineered biosensors can detect changes in lactate concentrations in the gut lumen of individual 
Caenorhabditis elegans nematodes (shown in this false-color image). 


phagocytic cells enhanced effe- 
rocytosis and was protective in 
mouse models of inflammatory 
diseases. —LBR 

Cell 185, 4887 (2022). 


ABORTION 
Risk of suicide 


Abortion access remains a 
divisive issue in the United 
States, and its media cover- 
age can affect women’s mental 
health. Although abortion 
legislation has been linked to 
women's anxiety and depres- 
sion rates, suicide risk has not 
been addressed. Zandberg 

et al. examined enforcement 

of state-level restrictions on 
reproductive care, including 
abortions, with state-level 
suicide rates between 1974 and 
2016. The authors used the 
Targeted Regulation of Abortion 
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Providers (TRAP) laws index, 
a measure that increases 
when a TRAP law is enforced 
and decreases when the law 
is blocked. Their analyses 
compared the impact of TRAP 
laws among reproductive-aged 
women (20 to 34 years old) 
with a control group of post- 
reproductive-age women (45 
to 64 years old). Enforcement 
of TRAP laws increased the 
annual suicide rate by 5.81% 
among the younger, reproduc- 
tive-aged women. The findings 
have implications for suicide 
prevention interventions for 
younger women in light of the 
recent reversal of legislation 
that made access to an abor- 
tion a federal right in the United 
States. —EEU 
JAMA Psychiatry 
10.1001/jamapsychiatry. 
2022.4394 (2022). 
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COSMOLOGY 
Supernovae support a 
cosmological constant 


Type la supernovae (SN las) can 
be used as standard candles, 
astronomical objects with 
intrinsic brightness that can be 
determined from observations, 
allowing their distance to be 
inferred independently of their 
redshift. Analysis of SN las can 
therefore be used to constrain 
cosmological models, which 
predict the redshift at a given 
distance. Brout et al. performed 
a joint analysis of more than 
1500 SN las, spanning redshifts 
from 0.001 to 2.3, and mea- 
sured cosmological parameters 
with greater precision than pre- 
vious supernova studies. Their 
results agree with standard 
cosmology and constrain the 
dark energy equation of state, 


finding that it is consistent with 

a cosmological constant. —KTS 
Astrophys. J. 10.3847/ 
1538-4357/ac8e04 (2022). 


METABOLISM 
Anew player in obesity 


Drugs that target serotonin have 
broad uses in treating neuro- 
psychiatric disorders, but there 
can be limiting side effects 
such as increased hunger and 
weight gain. The appetite-sup- 
pressing actions of serotonin 
are mediated through the 
5-hydroxytryptamine receptor 
2C (SHT.,R) expressed by hypo- 
thalamic proopiomelanocortin 
neurons. He et al. performed 
whole-exome sequencing in 
2548 people with severe obesity 
and found rare loss-of-function 
variants in the HTR2C gene, 
which encodes 5HT,.R. Mice 
expressing one of the HTR2C 
variants exhibited increased 
appetite and developed obesity 
when fed a high-fat diet despite 
increased physical activity. 
Thus, HTR2C variants likely 
contribute to some cases of 
severe childhood-onset obesity, 
and 5HT,.R signaling may offer 
a therapeutic target. -GKA 

Nat. Med. 28, 2537 (2022). 


FULLERENE CHEMISTRY 
Decorating molecular 
soccer balls 


The extraordinary finale of last 
year's World Cup treated fans 
to the marvelous ball-handling 
skills of Messi and Mbappé. The 
molecular realm also features 
a soccer ball, the icosahedral 
C,, fullerene, and chemists too 
strive to manipulate it skillfully. 
Lu et al. report chiral pyramidal 
capsules that can trap fullerene 
inside and adhere substituents 
to it selectively in just one of 
two mirror-image orientations. 
Specifically, the C,, reacts 
with anthracene groups on the 
capsule faces through Diels-Alder 
cycloaddition, after which the 
capsule can be disassembled by 
imine exchange to liberate the 
decorated ball. —JSY 

Nat. Chem. 10.1038/s41557-022- 

01103-y (2021). 
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LYMPHOMA 
How lymphomas 
outcompete 


Lymphomas are cancers of the 
immune system that arise from 
B cells undergoing a strict natu- 
ral selection process required for 
immunity. These highly mutating 
and dividing B cells vigorously 
compete against each other for 
T cell help to survive. Mutations 
affecting B cell translocation 
gene 1 (BTG1) are exclusive to B 
cell lymphoma and associated 
with poor clinical outcomes. 
Mlynarczyk et al. found that 
mutant BTG1 effects were 
limited to conferring B cells with 
only subtle acceleration of their 
T cell help response. This effect 
occurred at the checkpoint that 
governs natural selection of 
B cells, so these cells became 
“supercompetitors” that out- 
paced and replaced their normal 
counterparts. This behavior 
mirrors embryonic-specific 
supercompetition processes, 
pointing to BTG1 as an evolu- 
tionary “gatekeeper” of natural 
selection during the adaptive 
immune response. —SMH and 
PNK 

Science, abj7412, this issue p. 252 


CRISPR 


Adecade of CRISPR 

In the decade since the pub- 
lication of CRISPR-Cas9 as a 
genome-editing technology, the 
CRISPR toolbox and its applica- 
tions have profoundly changed 
basic and applied biological 
research. Wang and Doudna now 
review the origins and utility of 
CRISPR-based genome editing, 
the successes and current limi- 
tations of the technology, and 
where innovation and engineer- 
ing are needed. The authors 
describe important advances 

in the development of CRISPR 
genome-editing technology and 
make predictions about where 
the field is headed. They also 
highlight specific examples in 
medicine and agriculture that 
show how CRISPR is already 
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affecting society, with exciting 
opportunities for the future. —DJ 
Science, add8643, this issue p. 251 


CANCER GENOMICS 
Sarcoma genes in 
the limelight 


Sarcomas are cancers of 
muscles, bones, and other 
connective tissues that tend 
to develop in younger patients 
and are often aggressive and 
difficult to treat. Because of 
their relative rarity, the biology 
of sarcomas is not nearly as 
well understood as that of more 
common cancers. In a mas- 
sive, multinational genomic 
study involving thousands of 
patients, their families, and 
controls, Ballinger et al. identi- 
fied distinct biological pathways 
where mutations increase the 
inherited risk for developing 
sarcoma through alterations of 
telomere biology and mitotic 
function (see the Perspective 
by Mandelker and Ladanyi). 
More work is needed before 
these findings can lead to 
therapeutic advances, but the 
study provides much needed 
biological insight into a deadly 
disease. —YN 

Science, abj4784, this issue p. 253; 

see also adf8572, p.238 


ORGANIC CHEMISTRY 
Synthesis by strain 


Chemists often rely on strained 
intermediates to drive reactions 
that relieve the strain. In this 
context, however, cyclic allenes 
that cram adjacent double bonds 
into tight carbon rings have been 
underexploited. lppoliti et al. now 
report a synthetic route to lisso- 
dendoric acid A, a marine natural 
product that relies on transient 
generation of acyclic allene to 
prepare the fused ring core (see 
the Perspective by Jankovic 

and West). They accessed a 
single mirror-image form of the 
allene from a chiral precursor 

by fluoride attack on a silicon 
substituent and displacement 

of an adjacent bromide, setting 


up a stereospecific Diels-Alder 
cycloaddition with a pyrone 
reagent. —JSY 
Science, adeO032, this issue p. 261; 
see also ade7122, p. 237 


ZEOLITES 
Condensing chains into 
extra-large pores 


Zeolites with extra-large pores 
could absorb and process larger 
molecules, but strategies for 
synthesizing these materi- 
als are limited. Li et al. show 
that a silicate precursor with 
one-dimensional chains can 
undergo condensation reac- 
tions upon heating to remove 
the organic template to form 
Si-O-Si bridges that maintain 
the crystal topology (see the 
Perspective by Morris). The 
resulting low-density, pure silica 
zeolite has 14- and 16-mem- 
bered silicate rings and highly 
thermal and hydrothermal 
stability. —PDS 

Science, ade1771, this issue p. 283; 

see also adf3961, p.236 


IMMUNOTHERAPY 
Activating the STING 


in tumors 


Agents that activate the DNA- 
sensing CGAS-STING pathway 
have been explored for can- 
cer immunotherapy in both 
preclinical and clinical settings, 
but how to optimally activate 
the pathway remains unclear. 
Jneid et al. found that incorpora- 
tion of the naturally occurring 
STING ligand cyclic guanosine 
monophosphate-adenosine 
monophosphate (CGAMP) into 
noninfectious enveloped virus- 
like particles (VLPs) selectively 
activated STING in antigen- 
presenting cells, including 
dendritic cells, when adminis- 
tered intratumorally. Compared 
with a nontargeted synthetic 
STING agonist, VLPs delivering 
cGAMP systemically enhanced 
tumor-specific T cell responses 
and antitumor effects during 
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immune checkpoint blockade in 
mice. These results demonstrate 
that selectively targeting STING 
agonists to antigen-presenting 
cells may improve their thera- 
peutic effects, particularly in 
poorly immunogenic tumors. 
—CO 

Sci. Immunol. 8, eabn6612 (2023). 


LIGHT POLLUTION 
The night sky is rapidly 
getting brighter 
Artificial lighting that escapes 
into the sky causes it to glow, 
preventing humans and animals 
from seeing the stars. Satellites 
can measure the light emitted 
upward, but they are not sensi- 
tive to all wavelengths produced 
by LED lighting or to light 
emitted horizontally. Kyba et al. 
used data from citizen scientists 
to measure how light pollution 
is affecting human views of 
the stars worldwide (see the 
Perspective by Falchi and Bara). 
Participants were shown maps 
of the sky at different levels of 
light pollution and asked which 
most closely matched their 
view. Trends in the data showed 
that the average night sky got 
brighter by 9.6% per year from 
2011 to 2022, which is equivalent 
to doubling the sky brightness 
every 8 years. —KTS 

Science, abq7781, this issue p. 265; 

see also adf4952, p.234 
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CRISPR 


CRISPR technology: A decade of genome 
editing is only the beginning 


Joy Y. Wang and Jennifer A. Doudna* 


BACKGROUND: The fields of molecular biology, 
genetics, and genomics are at a critical 
juncture—a moment in history when a con- 
vergence of knowledge and methods has made 
it both technically possible and incredibly 
useful to edit specific base pairs or segments 
of DNA in cells and living organisms. The ad- 
vent of clustered regularly interspaced short 
palindromic repeat (CRISPR) genome editing, 
coupled with advances in computing and im- 
aging capabilities, has initiated a new era in 
which we can not only diagnose human dis- 
eases and even predict individual susceptibil- 
ity based on personal genetics but also act on 
that information. Likewise, we can both iden- 
tify and rapidly alter genes responsible for 
plant traits, transforming the pace of agricul- 
tural research and plant breeding. The appli- 
cations of this technology convergence are 
profound and far reaching—and they are hap- 
pening now. In the decade since the publica- 
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tion of CRISPR-Cas9 as a genome editing 
technology, the CRISPR toolbox and its appli- 
cations have profoundly changed biological 
research, impacting not only patients with 
genetic diseases but also agricultural practices 
and products. As a specific example from the 
field of genomic medicine, it has become fea- 
sible to obtain a complete sequence of the 
human genome in less than 24 hours—a stag- 
gering advance considering the first such 
sequence took 5 years to generate. Nota- 
bly, designing and putting to use a potent 
CRISPR genome editor to obtain clinically 
actionable information from that genome— 
previously a near-intractable challenge—now 
takes only a matter of days. For additional 
background and related topics, we refer read- 
ers to in-depth reviews of the microbiology 
and structural biology of CRISPR systems and 
to articles about the considerable ethical and 
societal challenges of this technology. 


Next 10 years 


CRISPR-based treatments 
in later stages of clinical trials 
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CRISPR: past, present, and future. The past decade of CRISPR technology has focused on building the 
platforms for generating gene knockouts, creating knockout mice and other animal models, genetic screening, and 
multiplexed editing. CRISPR's applications in medicine and agriculture are already beginning and will serve as the 
focus for the next decade as society's demands drive further innovation in CRISPR technology. 
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ADVANCES: The past decade has witnessed the 
discovery, engineering, and deployment of RNA- 
programmed genome editors across many ap- 
plications. By leveraging CRISPR-Cas9’s most 
fundamental activity to create a targeted ge- 
netic disruption in a gene or gene regulatory 
element, scientists have built successful plat- 
forms for the rapid creation of knockout mice 
and other animal models, genetic screening, 
and multiplexed editing. Beyond traditional 
CRISPR-Cas9-induced knockouts, base editing— 
a technology utilizing engineered Cas9's fused 
to enzymes that alter the chemical nature of 
DNA bases—has also provided a highly useful 
strategy to generate site-specific and precise 
point mutations. Over the past decade, scien- 
tists have utilized CRISPR technology as a 
readily adaptable tool to probe biological func- 
tion, dissect genetic interactions, and inform 
strategies to combat human diseases and engi- 
neer crops. This Review covers the origins and 
successes of CRISPR-based genome editing and 
discusses the most pressing challenges, which 
include improving editing accuracy and pre- 
cision, implementing strategies for precise 
programmable genetic sequence insertions, 
improving targeted delivery of CRISPR edi- 
tors, and increasing access and affordability. 
We examine current efforts addressing these 
challenges, including emerging gene insertion 
technologies and new delivery modalities, and 
describe where further innovation and engi- 
neering are needed. CRISPR genome editors 
are already being deployed in medicine and 
agriculture, and this Review highlights key 
examples, including a CRISPR-based therapy 
treating sickle cell disease, a more nutritious 
CRISPR-edited tomato, and a high-yield, disease- 
resistant CRISPR-edited wheat, to illustrate 
CRISPR’s current and potential future impacts 
in society. 


OUTLOOK: In the decade ahead, genome edit- 
ing research and applications will continue to 
expand and will intersect with advances in 
technologies, such as machine learning, live 
cell imaging, and sequencing. A combination 
of discovery and engineering will diversify and 
refine the CRISPR toolbox to combat current 
challenges and enable more wide-ranging ap- 
plications in both fundamental and applied 
research. Just as during the advent of CRISPR 
genome editing, a combination of scientific 
curiosity and the desire to benefit society 
will drive the next decade of innovation in 
CRISPR technology. 
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CRISPR 


CRISPR technology: A decade of genome 
editing is only the beginning 


Joy Y. Wang"? and Jennifer A. Doudnah?345-6.78« 


The advent of clustered regularly interspaced short palindromic repeat (CRISPR) genome editing, 
coupled with advances in computing and imaging capabilities, has initiated a new era in which genetic 
diseases and individual disease susceptibilities are both predictable and actionable. Likewise, genes 
responsible for plant traits can be identified and altered quickly, transforming the pace of agricultural 
research and plant breeding. In this Review, we discuss the current state of CRISPR-mediated genetic 
manipulation in human cells, animals, and plants along with relevant successes and challenges 

and present a roadmap for the future of this technology. 


tarting with a 1987 report about repeti- 

tive DNA sequences in a bacterial genome 

(1), a small set of researchers working 

in the fields of microbiology and food 

science began studying mysterious DNA 
sequence arrays known as clustered regularly 
interspaced short palindromic repeats (CRISPRs), 
commonly found in microbial genomes together 
with genes encoding CRISPR-associated (Cas) 
proteins. The presence of short DNA sequences 
within CRISPRs matching those in viruses 
hinted at the function of these systems as adap- 
tive immunity pathways used to prevent viral 
infection (Fig. 1A) (2). Curiosity-driven research 
ultimately showed how CRISPR systems use 
RNA molecules transcribed from the sequence 
arrays to guide Cas proteins to cut, and there- 
by destroy, viral DNA or RNA (3, 4). Further- 
more, this line of research showed how CRISPR’s 
RNA-programmed cutting action (5, 6) could 
be used to alter DNA sequences in any cell with 
unprecedented ease [reviewed in (7)]. Over the 
past decade, scientists around the world have 
rapidly adapted CRISPR to enable both funda- 
mental research and wide-ranging applications 
in animals, plants, and humans. 

The most widely used genome editor is the 
CRISPR-Cas9 protein complexed with its part- 
ner RNA (Fig. 1B). The power of CRISPR asa 
genome editing technology stems from its 
chemical mechanism of DNA cutting at a site 
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dictated by RNA-determined sequence recog- 
nition. Because Cas proteins use RNA-DNA 
base pairings for DNA recognition, the same 
protein, such as Cas9, can target a wide range of 
DNA sequences by simply swapping guide RNAs 
(Fig. 1B). In eukaryotic cells, DNA breaks are 
efficiently repaired, enabling targeted changes 
to DNA sequences at will (Fig. 1B). Mutating 
the amino acids required for cleavage activity 
in the Cas9 active sites allows targeted DNA 
nicking (introducing a single-stranded DNA cut) 
or DNA binding by a catalytically inactive Cas9. 
As a result, the first examples of engineered 
CRISPR-Cas involved transcriptional repres- 
sion or activation to silence or up-regulate spe- 
cific genes (8, 9). Other forms of engineered Cas9 
are fused to enzymes that enable individual 
nucleobase editing, chromatin modification, 
or sequence insertion (JO-13). Other Cas pro- 
teins, including RNA-targeting proteins, have 
been explored as genome-modifying tools, en- 
abled by discovery efforts and extensive biochem- 
ical and structural characterization [reviewed 
in (/4-17)]. Some of these enzymes have also 
been harnessed for the development of im- 
aging methods (78-20) and diagnostic ap- 
proaches (21, 22). 

Together, applications of CRISPR technol- 
ogy have provided the foundation for clinical 
trials of therapies to treat sickle cell disease, 
beta-thalassemia, the degenerative disease 
transthyretin (TTR) amyloidosis, and congen- 
ital eye disease, as well as planned clinical 
trials for both rare (progeria, severe combined 
immunodeficiency, familial hypercholesterol- 
emia) and common (cancer, HIV infection) 
diseases. CRISPR technology has enabled agri- 
cultural advances including slick-coat cattle 
and a more nutritious tomato. It has spurred 
research across fields of molecular and cell 
biology, fueling the publication of thousands 
of research articles and providing a tool base 
for many companies focused on therapeutics, 
agriculture, and synthetic biology. However, 
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CRISPR technology and its potential impact 
are still in their early stages. As we discuss in 
the next section, some genome editing appli- 
cations have now become routine whereas 
others remain difficult due to limitations of 
today’s tools. These genome editing challenges 
provide opportunities for new discoveries and 
engineering to advance the field by offering a 
more complete toolbox for genetic manipulation. 


CRISPR-induced gene knockouts 


The past decade has witnessed the astounding 
success of CRISPR-induced gene knockouts, 
which have transformed basic and transla- 
tional research and demonstrate tremendous 
potential in agriculture and therapeutic devel- 
opment. Traditional CRISPR-induced knock- 
out methods in eukaryotic cells involve the 
CRISPR-Cas9 ribonucleoprotein (RNP), com- 
posed of the Cas9 nuclease and an engineered 
single-guide RNA molecule (sgRNA) (23-26). 
The sgRNA directs Cas9 to the target site, where 
it creates a double-stranded DNA break (DSB) 
that is repaired by endogenous repair path- 
ways including the nonhomologous end join- 
ing (NHEJ) and the microhomology-mediated 
end joining pathways and the more precise 
homology-directed repair (HDR) pathway that 
uses a repair template [reviewed in (27-29)] 
(Fig. 2A). Because of the high targeting spec- 
ificity and efficacy of CRISPR-Cas9, such gene 
knockouts are now routine in research appli- 
cations, providing a streamlined process to 
disrupt genes for functional study. 
CRISPR-Cas9 has proven to be successful in 
enabling the rapid creation of knockout (KO) 
mice and other animal models (30, 37) (Fig. 2B). 
Traditional gene targeting methods used in- 
efficient homologous recombination in em- 
bryonic stem (ES) cells, followed by laborious 
screening of modified ES cells for the desired 
sequence change and injection into WT em- 
bryos (32, 33). CRISPR-Cas9 provides a way to 
introduce DSBs in a one-cell-stage embryo, by- 
passing the screening stage of suitable targeted 
ES cells and greatly simplifying the production 
of gene-edited animals (34, 35). This has re- 
duced the time needed for generating geneti- 
cally modified mice, from 1 year to as few as 
4 weeks (36). As a result, the production of KO 
and transgenic mice has now become routine 
for research applications. Additionally, because 
most mammalian species lack established ES 
cell lines, CRISPR-Cas9 editing has facilitated 
the development of genetically engineered ani- 
mal models in new species (37, 38). The gener- 
ation of KO and transgenic animal models has 
become even more efficient with advancements 
in strategies for introducing CRISPR-Cas9 com- 
ponents into zygotes, including CRISPR RNP 
electroporation of zygotes (CRISPR-EZ) (39), 
CRISPR RNP electroporation and AAV donor 
infection (CRISPR-READI) (40), and improved 
genome editing through oviductal delivery of 
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nucleic acids (i-GONAD) (41). Beyond germline 
editing, CRISPR-Cas9 is also used in somatic 
editing, which is useful in situations where 
whole-body knockouts are lethal for embryos 
and in many cases can more accurately model 
cancer progression and realistic modes of 
therapeutic treatment (42). Advancements 
in in vivo delivery strategies (discussed later in 
this review) have expanded the types of somatic 
animal models that can be created. Through 
CRISPR-genome engineering methods, animal 
models have been developed for many diseases, 
including tyrosinemia, Duchenne muscular dys- 
trophy, cancer, osteoporosis, Huntington’s dis- 
ease, amyotrophic lateral sclerosis, Alzheimer’s 
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disease, and HIV-1/AIDS as just a few examples 
[reviewed in (38)]. The ability to rapidly create 
animal models has advanced and will continue 
to advance the study of genetic diseases, allowing 
researchers to study the causal relationships of 
specific genetic variations and disease and to de- 
velop and test new treatments for these diseases. 


CRISPR screens 


With the ease of introducing CRISPR-induced 
gene knockouts, researchers have successfully 
applied this technology to genetic screens, i.e., 
the systematic, targeted genetic alteration of a 
number of genes in parallel. Such CRISPR 
screens have become a powerful approach for 
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understanding genetic interactions and dis- 
secting biological pathways and have given 
rise to major advances in target discovery and 
drug development. The capabilities of CRISPR 
screens are continuing to expand, especially 
when combined with advancements in single- 
cell multiomics technologies. In general, genetic 
screens involve one or multiple gene perturba- 
tions, a model system such as engineered human 
cells, and a selection assay or readout to evaluate 
the effects of the perturbation(s) (43, 44) (Fig. 
2C). Because of its efficiency and flexibility, 
CRISPR editing is a powerful strategy for intro- 
ducing perturbations that can be used for close 
study of how a single gene disruption affects a 
cell of interest, as well as high-throughput testing 
of thousands of perturbations in pooled screens 
(45, 46). The ease of designing and cloning guide 
RNA (gRNAs) has enabled the development 
of gRNA libraries up to genome-wide scales, 
allowing researchers to perturb every gene in 
the human genome (47, 48). CRISPR technol- 
ogy advancements have also expanded the 
types of CRISPR screens that researchers can 
use for different applications. Beyond CRISPR 
KO screens, CRISPR interference (CRISPRi) and 
CRISPR activation (CRISPRa) screens have 
also become popular approaches that use re- 
versible gene expression control [reviewed in 
(49, 50)]. Saturation genome editing utilizing 
Cas9-mediated HDR enables the generation of 
all possible single-nucleotide polymorphisms 
(SNPs) for functional screening (57-53). More 
recently, as alternatives to Cas9-mediated HDR, 
researchers have also started applying CRISPR 
base and prime editing (discussed later in the 
Review) for genetic screens (54-56). Base edit- 
ing, which can introduce point mutations more 
efficiently than Cas9-mediated HDR with mini- 
mal indel formation, may serve as an improved 
strategy for functional variant screening (54). 
Prime editing allows for the introduction of 
small insertions and deletions in addition to 
point mutations, which essentially enables sat- 
uration mutagenesis across residues. This tech- 
nology is still relatively new, and it remains to be 
seen whether prime editing screens can achieve 
similar levels of flexibility as HDR with re- 
gard to target selection and targetable win- 
dow size (56). 

The successful editing of diverse cells and 
organisms using CRISPR technologies pro- 
vides flexibility for choosing a model system 
for the genetic screen to best answer the rele- 
vant biological question. Beyond primary cells, 
CRISPR screens have been developed in more 
complex model systems including organoids, 
animals, and plants (57-60). CRISPR screens 
are now a common method for probing gene 
function in cancer and have allowed for iden- 
tification of a variety of cancer drivers and 
regulators [reviewed in (45, 46)]. 

Following the introduction of CRISPR com- 
ponents into the model, a variety of techniques 
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can be used for the selection assay and readout. 
Common selection strategies include viability- 
or proliferation-based screens, fluorescence- 
activated cell sorting or microfluidics-assisted 
cell screening-based screens using cell surface 
proteins as markers (i.e., PD1, PDL1, MHC), 
and in vivo screens assaying phenotypes, such 
as tumor growth or sensitivity/resistance to 
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immunotherapy (45, 46, 67). One exciting area of 
development in the readout of CRISPR screens 
is new methods that provide simultaneous 
proteomic, epigenetic, and/or transcriptomic 
analyses, such as Procode (62), Perturb-ATAC 
(63), Perturb-seq (64), and ECCITE-seq (65), 
which can provide a wealth of information. 
The successes of CRISPR screens will continue 
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to accelerate with new technologies that im- 
prove the sensitivity of these assays and read- 
outs. We are only beginning to see the impact 
of combining CRISPR screens and single-cell 
multiomics modalities with the rapidly ad- 
vancing infrastructure for big data collection 
and analysis (64, 66, 67). Further discovery 
and engineering, including the enhancement 
of orthologous Cas9 enzymes or other RNA- 
guided nucleases such as Cas12a [reviewed in 
(7)], can also greatly increase the potential of 
combinatorial or multiplex CRISPR screens, 
which can reveal novel and complex genetic 
interactions. 


Multiplexed editing in plants and beyond 


Multiplex genome editing, or the simultane- 
ous targeting of multiple specific DNA loci in 
a genome, represents another area where 
CRISPR-induced gene knockout technology 
has been scaled up and adapted into a suc- 
cessful platform—particularly in the plant sci- 
ence fields [reviewed in (68, 69)] (Fig. 2D). Over 
the past decade, CRISPR-Cas9 has become a 
popular tool for plant editing. Traditional crop 
trait engineering methods involved random 
mutagenesis (e.g., with radiation) or trans- 
genesis with Agrobacterium followed by labo- 
rious crossing and screening to identify a plant 
with a new trait of interest. These processes 
are lengthy and hard to control in addition 
to facing substantial regulatory hurdles. By 
contrast, CRISPR-induced modifications are 
targeted, can be made rapidly, and generally 
represent small indels, i.e., insertions and/or 
deletions, or point mutations at locations spec- 
ified by the trait engineer [reviewed in (70)]. 
CRISPR-Cas9 has been adapted for simulta- 
neous multilocus editing, which is especially 
useful for editing crop species that can carry 
multiple copies of the target gene (e.g., hexa- 
ploid wheat) (77) and for crop domestication, 
which involves targeting multiple different 
genes [reviewed in (72)]. One advantage of 
CRISPR-Cas9 in regards to multiplexed edit- 
ing is separation of the nuclease and gRNA, 
such that multiple gRNAs can be used with 
one Cas protein to edit different targets (23). 
The gRNAs can be provided as multiple indi- 
vidual expression cassettes, each transcribed 
from their own promoters (73-75), or as a 
single polycistronic cassette that is processed 
posttranscriptionally (76-78). In the past few 
years, CRISPR multiplex genome editing has 
become a successful strategy for creating new 
crop genotypes and agriculturally useful traits 
in a single generation. One area where multi- 
plexed CRISPR-Cas9 editing has achieved suc- 
cessful results is in crop domestication and 
improvement. Examples include the use of 
multiplexed editing to disrupt domestication 
genes, introduce characteristics such as her- 
bicide resistance, and increase crop yield and 
quality (68, 79-81). 
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potential biological obstacles that can prevent the gene editor from reaching its target site. 


Beyond its success in plants, multiplexed 
CRISPR-Cas9 editing has been extended to other 
cell types and organisms. One notable example 
is the use of multiplexed editing to generate 
porcine endogenous retrovirus-inactivated pigs, 
addressing a major safety concern of transplant- 
ing pig organs into humans (82). Multiplexed 
editing has the potential to be particularly use- 
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ful in engineering cell therapy products for 
cancer and studying the effects of complex 
polygenic diseases. However, a challenge of 
CRISPR-Cas9 editing in mammalian cells that 
is exacerbated by simultaneous DNA cleavages 
in multiplexed editing is the possibility of trig- 
gering DNA damage-response mechanisms 
governed by transcription factor p53 that can 
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lead to cellular senescence or apoptosis (82-84). 
Addressing this challenge and extending the 
applications of multiplexed editing may be- 
come easier with the advancement of newer 
CRISPR precision editing technologies that 
do not involve DSBs, including base editing 
and prime editing (discussed further below), 
which have already been adapted for multi- 
plexed editing (85, 86). 


Site-specific modifications using base editing 


Beyond traditional CRISPR-induced knock- 
outs involving double-stranded DNA cutting, 
base editing (77)—which uses Cas effectors fused 
to enzymes that alter the chemical nature of 
DNA bases—has provided a successful strategy 
to generate site-specific and precise point mu- 
tations without DSBs, eliminating the need for 
repair templates and limiting undesired by- 
products during editing (Fig. 2E). CRISPR base 
editors generally consist of fusions between 
a Cas9 nickase (nCas9), a Cas9 variant that 
produces a single-stranded rather than a double- 
stranded break (or a catalytically inactive or 
“dead” Cas protein, such as dCas9, dCas12a, or 
dCas13b), and an enzyme that catalyzes a nu- 
cleobase deamination reaction [reviewed in 
(87, 88)]. The sgRNA directs the nCas9-deaminase 
fusion to the genomic target, where ternary 
complex formation exposes a region of ssDNA 
to the deaminase for chemical modification. 
The resulting base mismatch is then resolved 
through cellular repair mechanisms. Over the 
past few years, the toolbox of DNA and RNA 
base editors has expanded to enable C>T, A>G, 
C>G, A>I, and C>U conversions (JO, II, 89-93), 
though there is still need for further improve- 
ment, especially for C>G editing. Site-specific 
modifications broaden the abilities of researchers 
to study the effects of mutations within genes 
and can treat genetic disorders by correcting 
point mutations, which represent the largest 
class of human pathogenic genetic variants 
(94-97). Furthermore, base editing can intro- 
duce modifications in dividing and nondividing 
cells, providing an advantage over HDR, which 
is restricted to dividing cells (98). Base editing 
has already shown promising results in cor- 
recting loss-of-function mutations in a num- 
ber of mouse models [reviewed in (87, 88)], 
with a notable example being the recent use 
of in vivo base editing to correct Hutchinson- 
Gilford progeria syndrome in mice (99). By 
eliminating the introduction of DSBs, the 
development of base editing technology rep- 
resents an important step in precision editing. 
Refining these tools to the levels of accuracy 
and precision needed for treatments in humans 
will be one of the main challenges in the next 
decade. An early-stage clinical trial using base 
editing for familial hypercholesterolemia has 
already begun, and others using base edit- 
ing for sickle cell disease are set to begin this 
year (100). 
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Editing accuracy and precision 

As we transition into the next decade of CRISPR 
genome editing, several key challenges demand 
innovative solutions. Two of these are editing 
accuracy (i.e., specificity for the target site) and 
precision (i.e., producing the exact desired 
editing outcome) (Fig. 3A). To reduce the off- 
target effects of CRISPR-Cas nucleases result- 
ing from unintended binding and cleavage, 
researchers have utilized a combination of ra- 
tional design and selection to develop high- 
fidelity Cas variants [such as SpCas9-HF1 (107), 
evoCas9 (102), HiFiCas9 (103)] and the Cas9_ 
R63A/Q768A variant (104) and guide optimi- 
zation methods [such as E-Crisp (105, 106), 
CasOFFinder (707), and sgDesigner (J08)]. 
These efforts have been productive: Neither the 
CRISPR Therapeutics/Vertex nor the Intellia 
sgRNAs used in clinics today have measur- 
able off-target sites using US Food and Drug 
Administration (FDA)-grade assays (109, 110). 
However, off-target editing inaccuracies can 
also occur as a result of the Cas9-independent 
behavior of effector domains including deam- 
inases, reverse transcriptases, and transcrip- 
tional regulators, as exemplified by analysis 
of base editing outcomes (J1/, 112). Progress 
is currently being made through use of high- 
fidelity Cas variants and rational engineering 
of the deaminase domain to reduce nucleic 
acid binding without Cas assistance (11-115), 
and early-stage clinical trials of base editing 
(31, 116) offer encouragement in this regard. At 
the same time, innovating new methods to 
deliver Cas editors to the target site and refin- 
ing existing ones (described further below) can 
also minimize off-target effects. Editing pre- 
cision poses a larger challenge. In traditional 
CRISPR-Cas9 editing in eukaryotic cells, the 
scientist still does not fully control the editing 
outcome following the introduction of the 
DSB. Recent machine learning tools have been 
developed to help predict repair outcomes, al- 
though these have yet to be demonstrated for 
in vivo applications (117-119). Following the 
DSB, NHEJ—the default repair pathway for 
human cells—competes with the less-efficient 
HDR pathway and results in a spectrum of 
indels at the target site (720, 127). Although this 
may be acceptable for a number of CRISPR- 
induced knockout applications (including clin- 
ical), many therapeutic applications require 
much higher levels of precision and cannot 
afford undesired indels. Increasing editing 
precision requires a better understanding of 
DNA repair processes and a combination of 
innovation and engineering. One approach is 
improving HDR efficiency and/or suppressing 
NHEJ. Strategies that have been developed in- 
clude chemically inhibiting key enzymes in the 
NHEJ pathway, using single-stranded oligo- 
deoxynucleotide templates (which have been 
shown to increase HDR efficiency to 60% in 
human cells for a single-nucleotide substitu- 


Wang and Doudna, Science 379, eadd8643 (2023) 


tion) (722), utilizing cell cycle stage control to 
favor HDR repair (123-125) and using site- 
specific Cas9-oligonucleotide conjugates to 
recruit the donor DNA template to the target 
site (126). Even with these strategies, there 
are still risks of large deletions and chromo- 
somal rearrangements associated with DSB 
formation that can lead to genome instabil- 
ity (127, 128). Base editing and prime editing 
(addressed further below) represent another 
approach for precision editing intended to 
avoid DSB formation. Base editing and prime 
editing have reduced indel formation com- 
pared with classic Cas9-mediated editing. How- 
ever, in some cases, unintentional DSBs can still 
form at the editing site and lead to indels. It has 
been demonstrated that fusing base editors to 
Gam —a bacteriophage Mu protein that binds 
DSBs—can minimize indel formation during 
base editing (129). For base editing, editing 
precision is also challenged by bystander edit- 
ing (or undesired conversions of neighboring 
editable bases within or near the editing win- 
dow in addition to the target base) (87, 88). The 
correction precision of base editors decreases 
by a large margin when there is more than one 
target base in the editing window, limiting 
their therapeutic potential (730) (Fig. 3B). In 
a recent study, about half of the pathogenic 
single-nucleotide variants (SNVs) correctable 
by adenine base editors revealed =>50% correc- 
tion precision (130) (Fig. 3B). However, of the 
subset of SNVs containing more than one 
target base in the editing window, only 26% 
revealed =50% correction precision (Fig. 3B). 
Unfortunately, with current base editors, by- 
standers are a fairly common occurrence. In a 
recent study of 21 different base editing sys- 
tems, about half of the targetable pathogenic 
point mutations had bystanders in the editing 
window (737). Reducing the size of the editing 
window can increase precision; however, this 
also limits the genomic sites that can be tar- 
geted because of PAM constraints. A variety of 
strategies involving structure-guided mutagen- 
esis, directed evolution, and computational- 
aided design are currently being employed to 
increase the targeting scope of CRISPR-Cas9 
and reduce the bystander effects of base editors 
(132-134). For base editing to be an effective 
strategy for a wider breadth of applications, fur- 
ther engineering is required to build on current 
strategies to develop base editors with narrower 
editing windows and different PAM compatibil- 
ities without compromising on efficiency and 
targeting specificity. 


Genetic sequence insertions 


In recent years, emerging technologies are 
expanding the functional capabilities of the 
CRISPR toolbox to make precise programma- 
ble genetic sequence insertions, and an im- 
portant challenge in the next decade will be 
refining and effectively implementing these 
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technologies for genome engineering appli- 
cations. Traditional Cas9 editing can introduce 
transgenes by relying on HDR to incorporate 
genetic material from a co-delivered donor 
template into the target site (Fig. 3C) (27). 
Currently, this approach is being widely used 
in many areas of genome engineering. A no- 
table recent example of its application is the 
use of targeted integration to fluorescently 
tag more than 1000 human proteins to study 
their localization and interactions (135). HDR- 
mediated CRISPR-Cas9 editing has also shown 
promising results in preclinical and clinical 
testing for therapeutic development, with key 
examples in correcting alphal antitrypsin de- 
ficiency (736) and in cancer immunotherapy 
(137, 138). Despite these successes, HDR-mediated 
CRISPR-Cas9 editing has its limitations, includ- 
ing being restricted to dividing cells (123), the 
difficulty of donor template delivery, and the 
precision-related challenges introduced by 
the DSB. Although certain single-nucleotide 
mutations can be addressed by base editing, 
many human pathogenic genetic variants re- 
quire a small sequence insertion to repair an 
indel, calling for high-precision alternatives 
to HDR-mediated CRISPR-Cas9. 

Prime editing represents one such alterna- 
tive that can insert and delete DNA sequences 
without introducing DSBs (12), though this 
technology still needs further refinement (Fig. 
3C). Prime editors consist of nCas9 fused to a 
reverse transcriptase (RT) and a prime edit- 
ing gRNA (pegRNA) that serves both to direct 
nCas9 to the target site and act as a template 
containing the desired edit for the RT (72). 
Unlike HDR, prime editing can introduce mod- 
ifications in both dividing and nondividing 
cells, which is useful for correcting mutations 
in quiescent cells, such as neurons or hemato- 
poietic stem cells (739). Prime editing also pro- 
vides advantages over base editing in situations 
where there are multiple target bases in the 
editing window (132) and where a PAM se- 
quence is not immediately adjacent to the 
desired editing site (140). Currently, prime edit- 
ing has shown promise as an accurate and fairly 
precise editing tool that has been demonstrated 
to work in multiple cell types, organoids, mouse 
embryos, and plants but is still limited in its 
applications as a result of low editing effici- 
ency [reviewed in (139, 14D]. In two separate 
demonstrations of prime editing in organoids 
and mice, there were no detectable off-target 
edits (142, 143). Although low levels of undesired 
indel formation have been reported, the ratio of 
correct editing to indel formation was ~30 times 
higher for prime editing than for HDR (72, 142). 
Unfortunately, prime editing efficiencies are 
low for many applications. In one study, prime 
editing was more than 30 times less efficient 
than HDR in repairing a mutation in intesti- 
nal CF organoids (/44). Although more effi- 
cient prime editors have been developed, these 
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also result in higher rates of indel formation 
(72). Currently, base editors still have an ad- 
vantage over prime editors in editing effici- 
ency and precision (139). A main goal for prime 
editing in the next decade is improving effi- 
ciency without compromising editing product 
purity—an outcome that has the potential to 
turn prime editing into one of the most ver- 
satile tools for precision editing. Future studies 
should also address remaining uncertainties 
about the mechanism of prime editing. Recent 
results show that physical untethering of Cas9 
and the RT has no effect on prime editing lev- 
els in cells, suggesting that the RT could en- 
gage the editing site without being fused to 
Cas9 and raising questions about whether it 
could induce unintended integration at other 
RNA-DNA hybrid sites (73). Refining prime 
editing tools will require engineering and op- 
timizing the different constituent components, 
including the pegRNAs (145). 

For large gene insertions, an emerging area in 
CRISPR genome engineering is RNA-guided 
DNA transposition. CRISPR-associated trans- 
posons (CASTs) enable the precise RNA-guided 
integration of large DNA cargo up to 10 kb 
(146-149). So far, this has only been demon- 
strated in a few prokaryotes and has not yet 
been reported to work in mammalian cells 
(146, 147, 150, 151). There is potential for new 
developments as this area is still in the early 
stages of research with limited mechanistic 
understanding of how these systems work 
(15), and very few computationally predicted 
CAST systems have been characterized (149). 
Further discovery, testing, and engineering 
will be required to harness the potential of 
CASTs for genome engineering applications. 

Recombinases, which perform a wide array 
of activities, including insertions, deletions, 
inversions, and replacements (152, 153), are 
another area of tool development with the 
potential to combine with Cas proteins and 
may be able to further diversify the CRISPR 
toolbox (154). This has recently been dem- 
onstrated with the development of two new 
approaches that have enabled programmable 
integration of large DNA sequences in human 
cells (155, 156). One approach, programmable 
addition through site-specific targeting ele- 
ments, uses engineered fusion proteins of Cas9, 
a reverse transcriptase, and a serine integrase 
that have enabled multiplexed insertions of 
large DNA cargo, including the fluorescent 
tagging of different endogenous genes (155). 
Another approach uses twin prime editing, 
which involves a prime editor and two prime 
editing guide RNAs that have enabled large 
gene insertions and inversions when com- 
bined with a site-specific serine recombinase 
(156). This approach was used to correct a large 
sequence inversion associated with Hunter 
syndrome in human cells with up to ~9% ef- 
ficiency (156). Notably, these studies report no 
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detectable off-target insertions. These ap- 
proaches for programmable gene insertions 
will require further characterization and re- 
finement to increase editing efficiencies and 
to serve as potential therapeutic strategies. 
The potential impact of programmable gene 
insertion for genome engineering will con- 
tinue to motivate discovery and innovation 
in search of new strategies in addition to im- 
proving existing technologies. 


Delivery of editors ex vivo and in vivo 


Despite all of the recent advances in CRISPR 
editors, delivery of editors remains a major 
bottleneck for genome editing in organisms; 
both innovation and engineering are needed 
to ensure high delivery efficiency, target spec- 
ificity, and safety. Advances in delivery tech- 
nologies have played a large role in developing 
CRISPR-based therapeutics. The liver repre- 
sents a clear example where efficient delivery 
of CRISPR editors has been a clinically trac- 
table challenge (70). However, for less accessible 
organs, the feasibility of CRISPR therapeutics 
is limited by low delivery efficiencies and will 
largely depend on improved delivery strat- 
egies. Current delivery strategies for potential 
CRISPR-based treatments in humans are di- 
vided into two types of approaches: ex vivo, 
where cells are isolated from and modified out- 
side of the patient before being reintroduced, 
and in vivo, where cells are edited directly in 
the patient following delivery of CRISPR com- 
ponents [reviewed in (157-160)]. Ex vivo ap- 


Box 1. Cost, regulation, and access. 


proaches, often used for editing hematopoietic 
stem and progenitor cells and leukocytes, offer 
higher cell-type specificity and tighter qual- 
ity control of editing; however, they are limited 
to cell types that can survive and be expanded 
in culture (to achieve a minimum number for 
reengraftment) and retain in vivo function. 
In vivo approaches expand CRISPR editing to 
cell types where ex vivo approaches are not 
possible, allowing CRISPR to treat a wider range 
of genetic diseases. Two notable examples 
where in vivo delivery has had some success in 
humans are the treatment of transthyretin 
amyloidosis, which represents the first systemic 
in vivo delivery of CRISPR to the liver using 
targeted lipid nanoparticle (LNP) delivery (770), 
and treatment of Leber congenital amaurosis 
type 10, which involves direct injection of an 
adeno-associated viral vector harboring the 
RNA-guided enzyme into the eye (J6/, 162). 

These successes show the tremendous po- 
tential of in vivo therapeutic genome editing; 
however, in general, in vivo delivery of CRISPR 
editors remains a formidable challenge. Many 
biological obstacles stand in the way of ef- 
fective in vivo delivery of editors to targets. In 
the case of systemic delivery, delivery vehicles 
need to prevent degradation of the cargo, op- 
sonization, and phagocytosis extravasate from 
the blood vessel; pass efficiently through the 
interstitial space; and effectively release cargo 
upon endocytosis (Fig. 3D). Once the CRISPR 
cargo is released, it also needs to localize to 
the nucleus and access the target locus in the 


With the growing therapeutic potential of CRISPR technologies, other important considerations include 
affordability, regulation, and access. For a more in-depth examination of the ethical and societal chal- 
lenges of these technologies, we refer readers to refs. (196, 197). One main challenge to the development 
and accessibility of CRISPR therapeutics is cost. In many cases, manufacturing costs encompass the 
expenses of producing both the CRISPR editors and the delivery vehicles, which can be difficult to scale 
up. For instance, viral-based delivery is a popular strategy for developing CRISPR therapeutics, but the 
manufacturing of viral vectors depends on expensive culture systems and facilities to produce required 
quantities of virus (175). Improving process development and providing the infrastructure to scale up the 
production of viral vectors will be important to reduce costs. Treatment administration can also be expensive, 
especially for ex vivo approaches, which consume time and resources to expand cells in culture and, in cases 
of bone marrow transplantation with autologous hematopoietic stem cells, require preconditioning patients 


with chemotherapy (198). 


Manufacturers also face the burden of regulatory costs to provide extensive characterization and strict 
safety and quality controls, which can be challenging for investigational or academic manufacturing 
facilities. As increasing numbers of CRISPR-based treatments move to later stages of clinical trials, 
manufacturers will need to build the infrastructure and bear the costs to support a current good man- 
ufacturing practice-compliant operation. The challenges of bearing these costs and getting FDA approval 
can lead to abandonment of the therapeutic development in a for-profit setting, as was recently the case 
for a gene therapy for adenosine deaminase severe combined immunodeficiency (ADA-SCID) that had 
promising long-term results (199, 200). Even if a treatment passes through all clinical trial phases and 
gets FDA approval, the potential retail price charged to cover manufacturing costs may be unaffordable 
to most patients without changes to the current health care infrastructure. Although many of the costs 
associated with developing new therapies are unavoidable, the motivation to make future CRISPR 
therapeutics widely accessible to those who need them will drive innovation of more efficient and cost- 
effective strategies for large-scale production that can meet regulatory standards. 
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chromosome. Each of these intermediary steps 
or requirements between the initial intra- 
venous injection and the actual editing of 
the target locus has its own set of challenges 
that will necessitate both engineering and 
innovation to overcome. One of these is con- 
trolling delivery vehicle size (often restricted 
by cargo size), which may present a challenge 
for bypassing the vascular endothelium and 
interstitial space between blood vessels and 
target cells (757). This challenge has motivated 
efforts to engineer and discover smaller CRISPR 
editors that can be delivered more efficiently 
(163-167). Another challenge is preventing up- 
take and editing in off-target cells; this could 
be addressed by utilizing a targeting mole- 
cule, such as a single-chain variable fragment 
or glycoprotein, through conjugation to the 
CRISPR RNP (68). Other approaches include 
engineering or evolving the delivery vehicle to 
target specific cells (760). An alternative strat- 
egy to systemic delivery is direct injection to a 
specific tissue, which largely avoids editing 
other unintended tissues and organs. How- 
ever, direct injection results in genome editing 
of relatively few cells within a localized space 
and is practical only for organs directly acces- 
sible to such injection (J6/, 169). 

In these various approaches, the cargo is 
delivered as one of three forms: a plasmid DNA 
(>6400 kDa for a 10-kb plasmid DNA) encoding 
the CRISPR-Cas9 and gRNA (either together 
or separately), the Cas9 mRNA (1400 kDa) 
and gRNA (34 kDa), or the Cas9-gRNA RNP 
(194 kDa) [reviewed in (158, 170)]. DNA cargo 
is relatively stable compared with RNA or 
protein, but its delivery results in the slowest 
initiation of editing and offers lower control 
over the functional RNP concentration in the 
system at any given time. In some cases, DNA 
cargo can permit prolonged expression of 
Cas9, but this increases the probability of 
off-target effects and immunogenic reactions 
(170, 171). Of the three, RNP delivery results in 
the fastest initiation of editing and generally 
results in lower off-target effects, but methods 
for delivering RNPs are still limited. 

Currently, a variety of delivery methods exist 
for CRISPR gene editing in mammalian sys- 
tems, though each has its own set of challenges 
and limitations. These are broadly divided 
into physical delivery, viral-based delivery, and 
synthetic material-based delivery [reviewed 
in (158, 159)]. Common physical delivery ap- 
proaches include microinjection and electro- 
poration, which can deliver CRISPR editors in 
all three forms of cargo (170). These methods 
allow for controlled dosage and high-efficiency 
delivery; however, both methods are effec- 
tively limited to ex vivo delivery. Viral-based 
delivery methods include adeno-associated 
viruses (AAVs), adenoviruses (AdVs), and lenti- 
viruses, which deliver CRISPR cargo in the 
form of plasmid DNA and offer high delivery 
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efficiencies by harnessing delivery vehicles 
that viruses have had thousands of years to 
evolve. Of these, AAVs are the most promising 
for in vivo clinical use for CRISPR therapeutics, 
with notable examples being an ongoing clin- 
ical trial for Leber congenital amaurosis type 10 
(1671), a soon-to-begin trial for HIV (172), and 
advanced preclinical work for Hutchinson- 
Gilford progeria syndrome (99). However, one 
main limitation of AAVs is their low packag- 
ing capacity (173). Compared with AAVs, AdVs 
and lentiviruses offer higher packaging effi- 
ciencies and have also been used for CRISPR- 
Cas9 delivery but have faced other challenges, 
including immunogenicity concerns [reviewed 
in (157, 159, 174)]. Another important consid- 
eration for viral-based delivery is the high cost 
and labor-intensive production, especially for 
AAVs and AdVs (175), which represents a ma- 
jor challenge for large-scale manufacturing 
and patient treatment (Box 1). Compared with 
viral-based delivery, synthetic material-based 
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delivery methods, which include LNPs (176), 
cationic polymers and peptides (177-181), and 
gold nanoparticles (182), are often safer and 
offer high levels of control and flexibility be- 
cause they can be more easily tailored to all 
three forms of cargo (DNA, mRNA, and RNP) 
and be optimized for immunocompatibility 
(157). Notably, as mentioned above, LNPs are 
the delivery strategy used in the first systemic 
in vivo delivery of CRISPR in humans for the 
successful treatment of transthyretin amyloid- 
osis (110). In general, however, these syn- 
thetic material-based delivery methods offer 
lower delivery efficiencies compared with 
viral-based methods (159), limiting their effec- 
tiveness for in vivo delivery to less accessible 
target organs. Although improvements can be 
made with further optimization, their maxi- 
mum efficiency may be limited by the mate- 
rials’ bulky size and cationic nature, which 
result in poor interstitial dispersion. Recently, 
extracellular vesicles and virus-like particles 
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Fig. 4. Future directions: where we could be 10 years from now. The future of CRISPR genome editing will 
intersect with advances in technologies such as machine learning, live-cell imaging, and sequencing. In the near 
future, we may witness FDA approval for the first CRISPR-based medicine, as well as increasing numbers of 
CRISPR treatments moving to later stages of clinical trials and approval of new clinical trials using improved in vivo 
delivery methods. We expect approval of more CRISPR-edited crops for sale and more demonstrations of CRISPR 
used to engineer multigenic traits in plants and animals. In the more distant future, we may one day see many 
widely accessible CRISPR-based treatments and even use genome editing to safely harvest pig organs for 
transplant patients or as a prophylactic against disease. In agriculture, CRISPR may be routinely used to generate 
disease-resistant, high-yield crops to increase global food supply and security. 
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(VLPs) (assemblies of the viral envelope and/or 
structural proteins that can transduce cells but 
lack viral genetic material) are emerging as 
promising delivery platforms that utilize the 
strengths from both viral-based delivery and 
synthetic material-based delivery (183-186). 
These have the potential to achieve the high- 
delivery efficiencies of viral-based methods 
without the safety concerns of random trans- 
gene integration or prolonged expression of 
the editors. One especially exciting area of de- 
velopment is programming the cellular tropism 
of VLPs by using different envelope glycopro- 
teins to target specific cell types, as recently 
demonstrated for ex vivo and in vivo delivery 
and editing (185, 186). The future of CRISPR 
treatments in humans will largely depend on 
improving current delivery strategies, innovat- 
ing new delivery modalities, discovering and 
engineering more compact CRISPR editors, or 
a combination of the above. 

Beyond mammalian systems, advances are 
also being made for CRISPR reagent delivery 
in plants. The plant’s cell wall, which has a size 
exclusion limit of 5 to 20 nm, presents a major 
challenge for cargo to pass through (187). The 
two predominant methods are delivery of plas- 
mid DNA using agrobacterium, which inte- 
grates the transfer DNA into the plant genome, 
and particle bombardment, which physically 
breaches the cell wall barrier to introduce the 
cargo [reviewed in (187)]. The main drawback 
of these methods is random integration of the 
CRISPR cassette into the plant genome. The 
goal of enabling transgene-free breeding has 
motivated alternative approaches, including 
direct RNP delivery by polyethylene glycol- 
mediated cell transfection, particle bombard- 
ment, electroporation, or lipofection [reviewed 
in (788)]. Though these methods hold promise, 
the broad application of RNPs in many plant 
species requires further work in improving 
delivery efficiency and plant regeneration from 
edited protoplasts. 


Current and future applications 


The advent of programmable genome editing 
technologies has paved the way for appli- 
cations of cell and gene therapy to treat and 
even cure disease. Although applications of 
CRISPR are too numerous to list comprehen- 
sively, the treatment of sickle cell disease (SCD) 
provides an excellent example of the value and 
risks ahead. There are now at least eight FDA- 
approved clinical trials of CRISPR-based ther- 
apies for SCD and related blood disorders that 
are ongoing or soon to begin, and FDA ap- 
proval of the first is expected in 2023 (189). 
However, the challenges to widespread deploy- 
ment of CRISPR cures for SCD remain for- 
midable. For genome editing to become a 
standard of care, the field will need to address 
the difficulty in manufacturing edited cells for 
each individual patient, the logistical limita- 
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tions conferred by the need for bone marrow 
transplantation, and the cost, which can run 
up to $2,000,000 per patient (190). But suppose 
a new delivery modality for genome editors 
obviates the need for both ex vivo cell editing 
and bone marrow transplantation? Such an 
advance would be game changing, propelling 
the field into a new era where much broader 
deployment of genomic therapies would be 
possible. 

Beyond clinical applications, CRISPR is be- 
ginning to have impacts in agriculture and 
animal husbandry. CRISPR-edited foods are 
already beginning to enter the market. This 
includes a CRISPR-created tomato with en- 
hanced nutritional qualities and two CRISPR- 
edited fish (a faster-growing tiger puffer and a 
red sea bream with greater edible yield), which 
have been approved for sale in Japan (197, 192). 
Among these agronomic applications are many 
examples where CRISPR has enabled the pre- 
cise “transfer” of small genetic changes that 
confer desirable traits from one variety of a 
species to another—a transfer that is either 
impossible or impractical using any other 
approach. Beyond small perturbations, CRISPR 
has also shown potential to generate new ge- 
netic variation and complex editing previously 
not seen in nature. A key example is the recent 
use of multiplexed editing to simultaneously 
knock out and activate different genes to intro- 
duce disease resistance in wheat and restore 
growth and yield (193). These constitute just the 
beginning of a wide range of genome editing 
advances that will increasingly affect our lives in 
the coming years. 

CRISPR-Cas9, part of a bacterial immune 
system, uses an RNA-guided mechanism to 
recognize and cut DNA sequences. This fun- 
damental biochemical activity forms the basis 
for genome editing technology that spans all 
realms of basic and applied biological re- 
search, ranging from developmental biology 
and plant genetics to sickle cell disease and 
animal husbandry. The discovery of new 
CRISPR-based and CRISPR-related enzymes 
has accelerated rapidly, leading to continued 
expansion of understanding about the natural 
biology of these systems in microbes and their 
utility for genome editing in other cells and 
organisms. CRISPR-Cas9 and CRISPR-Cas12a, 
the most widely adopted genome editing en- 
zymes, have become workhorses in research 
laboratories worldwide. Fundamental research 
enabled by genome editing speaks to the cross- 
cutting nature of CRISPR technology and to 
the timeliness of its arrival as a readily adapt- 
able tool. These wide-ranging applications 
have in turn enabled expansion of the CRISPR 
toolbox to enable more precise editing of spe- 
cific nucleotides or the targeted integration of 
new genetic information. In the decade ahead, 
genome editing research and applications will 
continue to accelerate and will increasingly 
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intersect with technologies including machine 
learning, live cell imaging, and faster, cheaper 
DNA sequencing (Fig. 4). Just as the past de- 
cade has focused on CRISPR platforms, the 
decade ahead will increasingly apply those 
platforms for real-world impacts. In the clinic, 
we will undoubtedly see increased numbers 
and types of clinical trials, providing data that 
will guide next-generation gene and cell ther- 
apies. As clinical applications expand, there 
may be an opening for CRISPR to be used to 
protect health. For example, as safety and ef- 
ficacy are established for disease treatment, 
genome editing might become a prophylactic 
against neurodegenerative or cardiovascular 
disease (37). Such opportunities would require 
detailed knowledge of the genetics of multi- 
genic disease and the means to deliver to or- 
gans including the brain and heart—neither of 
which are small tasks. But the potential bene- 
fits may drive innovation in these areas well 
beyond what is possible today. In agriculture, 
CRISPR screening will provide increasing in- 
sights into paths to engineering multigenic 
traits in both plants and animals. Products gen- 
erated using CRISPR—whether pig organs 
for transplant patients (794), rice that resists 
drought with increased yield, or microbiomes 
fine-tuned for health using CRISPR editing— 
may all become routine. CRISPR also serves as 
a notable example of the connection between 
curiosity-driven research, innovation, and tech- 
nological breakthroughs. By continuing to ex- 
plore the natural world, we will discover what 
cannot be imagined and put it to real-world 
use for the benefit of the planet. 
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INTRODUCTION: Diffuse large B cell lymphomas 
(DLBCLs) are aggressive malignancies of which 
~40% fail to respond or become refractory to 
treatment. Mechanisms that mediate these 
poor outcomes are unknown. Most DLBCLs 
originate from specialized B cells participat- 
ing in the germinal center (GC) reaction. GCs 
are transient structures within which antigen- 
activated B cells undergo proliferative bursting 
and somatic hypermutation of their immuno- 
globulin genes to improve affinity against the 
encountered antigen. High proliferation activ- 
ity depletes GC B cells of essential biosynthetic 
precursors. Hence, GC B cells must endure an 
intense Darwinian competition process, during 
which they vigorously compete for access to 
positive selection signals from a limiting 
number of T follicular helper (Ty) cells. Only 
few B cells that win this competition receive 


Faster germinal center kinetics 


Aggressive 
lymphoma 


Try cell help, which triggers transient expres- 
sion of the Myc proto-oncogene and biosyn- 
thetic growth programs. Unfortunately, GC 
B cells acquire abundant off-target mutations, 
making them prone to malignant transfor- 
mation. Hence, GC B cell selection by Ty cells 
requires rigorous, but still largely unknown, 
control mechanisms to prevent unwanted clonal 
expansion and malignant transformation of 
abnormal cells. 


RATIONALE: Missense mutations of BTGI (B 
cell translocation gene 1) are specific to GC- 
derived B cell lymphomas, suggesting that their 
oncogenic function is cell-context specific. BTG 
mutations help to genetically define a class of 
DLBCLs that manifest especially poor clinical 
outcomes and extensive dissemination. This 
prompted us to explore how BTGI mutations 


Super- 
competition 


Supercompetition in germinal centers leads to aggressive B cell lymphomas. GC B cells vigorously 
compete against each other to receive limiting T cell help, which activates MYC-dependent growth 
programs in preparation for subsequent proliferative bursting. BTG1 mutant GC B cells respond to T cell 
help signals with faster MYC induction kinetics and more-rapid entry into the next proliferative phase, 
which, over many GC rounds, allows them to outcompete wild-type cells, leading to highly fit and 


disseminated B cell lymphomas. 
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contribute to the pathogenesis of these clin- 
ically challenging tumors. 


RESULTS: We generated mice for the B cell- 
conditional expression of the most frequent 
BTGI mutation [GIn*°—His (Q36H)]. Btg1o" 
cells almost completely outcompeted their 
wild-type counterparts, specifically in the GC. 
This competitive fitness manifested as a stron- 
ger induction of MYC-dependent growth pro- 
grams. Wild-type BTGI associated with many 
mRNAs, including those for MYC and MYC 
target genes, a function that was significantly 
lost by BTG12?™. Mutant BTG1 expression en- 
hanced MYC mRNA loading onto polysomes 
and induced slightly more-rapid MYC protein 
induction kinetics and a higher fraction of 
MYC-positive B cells in murine and human 
systems. These data suggested a lower thresh- 
old for BTG1 mutant cells to respond to Try- 
positive selection signals. Accordingly, Btgl 
mutant GC B cells experienced faster cell cycle 
S phase transit and earlier entry into sub- 
sequent proliferative bursts. In Bcl2-driven 
lymphoma mouse models, Btg1?**" markedly 
accelerated disease onset, shortened survival, 
and yielded particularly invasive DLBCL-like 
lymphomas. Btg!2?*" lymphomas were highly 
clonal and heavily mutated, reflecting a lym- 
phomagenesis trajectory with increased selec- 
tion and mutation rounds over time. In patients, 
BTGI mutations scored as strong genetic DLBCL 
drivers and independently associated with sig- 
nificantly inferior clinical outcomes. 


CONCLUSION: Our data suggest that BTG1 
serves as a critical gatekeeper controlling a 
key fitness checkpoint for natural selection of 
intensely competing B cells during the adap- 
tive immune response. The precise targeting of 
BTGI1 function by missense mutations causes 
a subtle biochemical impact, whose specific 
engagement during repetitive rounds of GC 
B cell positive selection likely explains its sub- 
stantial oncogenic fitness effect. This reveals 
the fragility of constraints required to prevent 
competing B cells from recalling features of 
uncontrolled natural selection among uni- 
cellular organisms and highlights a fine-tuned 
balance between protection against infection 
and cancer risk. This effect is reminiscent of 
the Myc-dependent supercompetition first 
described during Drosophila development. 
Future therapeutic strategies could potentially 
take advantage of such evolutionary vulner- 
abilities to target cancer cell fitness. 
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Multicellular life requires altruistic cooperation between cells. The adaptive immune system is a notable 
exception, wherein germinal center B cells compete vigorously for limiting positive selection signals. 
Studying primary human lymphomas and developing new mouse models, we found that mutations affecting 
BTGI disrupt a critical immune gatekeeper mechanism that strictly limits B cell fitness during antibody 
affinity maturation. This mechanism converted germinal center B cells into supercompetitors that rapidly 
outstrip their normal counterparts. This effect was conferred by a small shift in MYC protein induction 
kinetics but resulted in aggressive invasive lymphomas, which in humans are linked to dire clinical outcomes. 
Our findings reveal a delicate evolutionary trade-off between natural selection of B cells to provide immunity 


and potentially dangerous features that recall the more competitive nature of unicellular organisms. 


uring the humoral immune response, 

germinal center (GC) B cells undergo 

iterative rounds of natural selection to 

generate high-affinity B cell receptors. 

This process involves somatic hyper- 
mutation and clonal expansion of GC B cells 
and strictly depends on B cells competing to 
receive positive selection signals from T follic- 
ular helper (Ty) cells (7, 2). Ty cell help allows 
a few GC B cells to survive and differentiate 
into memory B cells or plasma cells, or instead 
undergo additional rounds of proliferative 
bursting and mutagenesis. Transient induc- 
tion of Myc- and mammalian target of ra- 
pamycin complex 1 (mTORC1)-biosynthetic 
programs is required to accumulate biomass 
in preparation for the subsequent prolifera- 
tive burst (3-5). 

The mutagenic and proliferative nature of 
GC B cells places them at considerable risk 
for malignant transformation (6, 7). Hence, 
most immune system neoplasms arise from B 
cells that have transited the GC reaction. The 


most common of these are diffuse large B cell 
lymphomas (DLBCLs), which have classically 
been separated into germinal center B-like 
(GCB) and activated B-like (ABC) DLBCLs ac- 
cording to their transcriptional profiles (8). 
DLBCLs were more recently segregated into 
genetically defined entities with distinct bio- 
logical characteristics and mutational profiles 
(9-11). Among these, the most clinically chal- 
lenging, MCD/cluster 5, is an ABC-DLBCL sub- 
type featuring extranodal dissemination and 
high fraction of proliferating cells. Recurrent 
somatic mutations in MYD88 and CD79B drive 
proliferation of these lymphomas through 
chronic activation of Toll-like receptor and 
B cell receptor signaling, whereas TBLIXR1 
mutations endow them with a post-GC mem- 
ory B cell phenotype (72, 13). However, the 
basis for their marked fitness and aggressive 
phenotypes remains unknown. 

Possible clues to these phenotypes may be 
gleaned from mechanistic deconvolution of 
other recurrent mutations. For example, so- 


matic missense mutations of BTGI (B cell trans- 
location gene 1) occur in up to 70% of MCD- 
DLBCL cases (17). BTG1 has been reported to 
regulate gene expression by interacting with 
transcription factors, mRNA stability regu- 
latory proteins (CNOTs), or the arginine methyl- 
transferase PRMT1 (/4-18). Although BTG1 
loss results in minor perturbations of early 
B cell development (19), and a few BTG1 mu- 
tants show differential CNOT interaction and 
tethered-mRNA stability effects in yeast and 
human embryonic kidney 293T (HEK293T) 
cells (20), the role of BTG1 in mature B cells 
and how BTG1 lymphoma mutations contrib- 
ute to lymphomagenesis remain unknown. 
Furthermore, BTGI is not deleted in DLBCLs, 
suggesting that mutations are not simple loss- 
of-function alleles (77). In this study, we probed 
the effect of BTGI mutation on GC biology 
and malignant transformation. BTGI muta- 
tion conferred a supercompetitive phenotype 
to GC B cells through a discrete acceleration of 
their response to Try; cell help, resulting in the 
formation of highly aggressive lymphomas. 
Our findings illustrate how subtle biochemical 
perturbations in GC B cells can confer marked 
competitiveness and oncogenic transforma- 
tion potential in concert with microenviron- 
mental cues such as Try cell help. 


Results 
BTG1 somatic mutations in DLBCLs are 
genetic drivers 


We surveyed the genomic profiles of 25,670 
cancer patients, including publicly available 
DLBCL datasets (9, 2/-29). BTG1 somatic mu- 
tations were detected in 11% of DLBCL cases 
(n = 272/2407), which is frequent given their 
high genetic heterogeneity, but were rare in 
other tumor types, suggesting specific BTG1 
function in this context (fig. SIA). Within 
DLBCLs, BTGI mutation was enriched in the 
more clinically unfavorable ABC-DLBCLs (P = 
0.0184; fig. S1B), especially those defined as 
MCD/cluster 5 lymphomas, with 40% of all 
BTGI1 mutant DLBCLs being MCD (fig. S1C). 
BTGI mutations scored among the top DLBCL 
drivers in a rigorous genetic driver analysis 
taking into account multiple genomic and 
epigenomic covariates (Fig. 1A and table S1). 
Most BTGI mutant alleles corresponded to 
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heterozygous missense mutations clustered 
within the N terminus, which contains sev- 
eral conserved functional motifs (30) (Fig. 1B 
and table S2). Alpha helix 2 (a2) was most 
heavily mutated, especially at Gln®°, which was 
most frequently (86%) replaced by a histidine 
[Gln*°—-His (Q36H); Fig. 1, B and C, and fig. 
SID]. Q36H and additional BTG] N-terminal 
mutations can alter the conformational dis- 
tribution of BTG1 within the 02-04 region, 
as determined by in silico atomistic molec- 
ular dynamics simulation (37). 


Btg1°°5" generates supercompetitor GC B cells 


DLBCLs originate from B cells having transited 
the GC reaction. To determine the functional 
impact of BTG/ mutation during the humoral 
immune response, we generated a mouse mod- 
el for conditional expression of mutant Btgl 
from the Rosa26 locus (R26'! 8219364) refer- 
red to hereafter as Q36H (fig. S2, A to C). We 
then reverted the Q36H mutation to wild type 
to generate a wild-type knock-in (WTKI) con- 
trol line (R26'!'2™. fig S2, D to G). Q36H 
and WTKI mice were crossed to the Cd19“° 
strain to induce Bte1@*™" or Btel”™! expres- 
sion in B cells (32). Recombination and expres- 
sion of Q36H and WTKI alleles was validated 
in B cells at DNA, RNA, and protein levels (fig. 
$2, H to M). Notably, the total abundance of 
Bitgi transcripts was not increased upon ex- 
pression of Btg12?°" or Btg1”™! from the 
Rosa26 locus, indicating an absence of Btg1 
dosage effect (fig. S2, J and K). Q36H or WTKI 
expression in B cells did not perturb early 
B cell development in the bone marrow nor 
steady-state mature B cell populations in sec- 
ondary lymphoid tissues and peritoneal cavity 
(fig. $3). 

To investigate the GC response, we immu- 
nized Q36H (R26! Bte1Q36H/+. cd19°’*), 
WTKI (R26! Btsiwt/+. Cq7g/*), and CRE- 
control (R26"/*; Cd19’*) or CREneg-control 
(R26 BislQs6H/+. Cq7g*/*) mice with sheep 
red blood cells (SRBCs), a T cell-dependent 
antigen, and analyzed spleens 10 days later. 
The GC response to SRBCs remained un- 
changed in Q36H and WTKI animals (fig. S4). 
Using a different T cell-dependent antigen 
[(4-hydroxy-3-nitrophenyl)acetyl (NP) conju- 
gated to ovalbumin (OVA)] to assess antigen- 
specific responding cells, we further observed 
no change in Q36H versus CRE mice over the 
course of the GC reaction (fig. S5). 

Most GC-derived lymphoma oncogenes af- 
fect GC size, composition, or output. Hence, 
the absence of such effects in the Btg1??°" 
setting was puzzling. However, these aspects 
do not necessarily reflect possible selective 
advantages within B cell clones. To directly 
test competitive fitness of Q36H GC B cells 
in vivo, we crossed our mice to the B1-8" al- 
lele. B1-8™ encodes a B cell receptor with high 
NP affinity in B cells with a lambda immuno- 
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Fig. 1. Characterization of BTG1 missense mutations in DLBCL. (A) Quantile-quantile plot of observed 
and expected P values from a gamma-Poisson distribution for single nucleotide variants across 101 germline- 
matched whole genome-sequenced DLBCLs. Mutated genes with FDR < 0.001 are depicted with a red 

dot. (B) BTG1 protein and frequency, type, and location of mutations in n = 272 unique cases out of n = 2407 
DLBCLs. LxxLL motifs mediate interaction with nuclear receptors. (©) BTG1 homology model (residues 1 
to 129) based on a structural reference of BTG2 (Protein Data Bank ID 3DJU) showing mutation frequency. 
N- and C-terminal ends are denoted. Single-letter abbreviations for the amino acid residues are as follows: 
A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; 


S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 


globulin light chain (~10 to 15% of murine 
B cells) that facilitates their entry into GCs 
upon NP immunization (33). We adoptive- 
ly transferred a 50:50 ratio of B1-8"/Q36H 
(B1-8h*”-, R26) Bs 1Q36H/+. cag"; CD45.1/ 
2) and B1-8"/CREneg-control (B1-8"'*/"; 
R26" BislQsoH/+. Cq7g*!/*; CD45.1) resting B cells 
into wild-type CD45.2 (non-B1-8") recip- 
ients (Fig. 2A). Q36H GC B cells showed a 
progressive competitive advantage over time, 
reaching up to ~90% of total GC B cells by day 
14 (Fig. 2A and fig. S6A). Q36H did not confer 
a competitive advantage to non-GC B cells (fig. 
S6A), and the GC fitness gain was not asso- 
ciated with changes in GC polarity (fig. S6B). A 
similar advantage of B1-8"'/Q36H GC B cells 
occurred when competing against B1-8"/ CRE- 
controls instead (B1-8"!*”"; R26*/*; Cdig*!*; 
CD45.1; fig. S6C). In contrast, B1-8"'/WTKI 
cells did not show a competitive advantage in 
GC B cells over non-GC B cells (fig. S6D). To 
confirm that the Q36H fitness advantage was 
not due to the extra R26"! copy or Cd19T° 
knock-in, we directly placed B1-8"/ Q36H and 
B1-8"!/WT KI cells in competition. Q36H again 
outcompeted WTKI cells in the GC compart- 
ment, whether gating or not on antigen- 
specific (lambda*NP") cells and even given 
a lower Q36H abundance in the starting NB or 
non-GC B compartment (Fig. 2B and fig. S7A). 
To assess Q36H GC B cell fitness in the context 
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of a polyclonal immune response (without 
B1-8"'-engineered antigen specificity), we 
transferred resting B cells from Q36H, WTKI, 
or CRE-controls into wild-type recipients and 
measured GC contribution of each genotype 
versus recipient cells. Q36H cells showed greater 
expansion within the GC compartment (18% 
average) as compared with WTKI and CRE 
at day 21 (~5%; Fig. 2C). Such expansion was 
similar in NP-specific GC B cells and did not 
occur in the NB compartment (fig. S7, B and 
C). In these experiments, Cd19“°-mediated 
recombination efficiency was as expected (32), 
and the Q36H GC B cell advantage was accom- 
panied by a measurable and significant en- 
richment of Q36H recombined alleles in GC 
B versus NB cells (fig. S7D). We next assessed 
proliferation in competing Q36H and CRE 
B cells in T cell-independent B cell activation 
by NP-Ficoll immunization. We observed sim- 
ilar proportion and carboxyfluorescein succi- 
nimidyl ester (CFSE) proliferation dye dilution 
of antigen-specific (lambda* NP*) Q36H and 
CREB cells over time (fig. S7E). Thus, Btg1?°" 
provided a competitive fitness advantage spe- 
cifically to GC B cells during T cell-dependent 
immune responses, independently of antigen 
affinity. Given the lack of GC-specific pheno- 
type in mice expressing Btg1”"™" in B cells, we 
focused subsequent functional experiments on 
mice expressing Btg12??", 
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Fig. 2. Btg1°°°4 generates supercompetitor 
GC B cells. (A) In vivo competitive assay 
schematic and representative flow cytometry 
plots for gating Q36H and CREneg from 
transferred germinal center (GC) B and non- 
GC B cells. Graph represents pooled mice 
from three independent experiments, each 
with n = 2 or 3 mice per time point. Mean + 
SD, paired t test (two-tailed). (B) Competitive 
assay between mature B cells expressing 
Btg1°°4 versus Btgl”” from Rosa26 (R26) 
knock-in alleles that differ by a single nucle- 
otide. Representative flow cytometry plots for 
gating antigen-specific (Ag-spe, lambda*NP*) 
naive B (NB, CD138°B220"lgD") and GC B 
(FAS*CD38°) WTKI and Q36H cells at day 14 
after immunization. Graph represents pooled 
mice from two independent experiments, 
epresentative of three [day 7 (d7) and d14] 
or two (d10) independent experiments, 

each with n = 3 to 5 mice per time point. 
ean + SD, paired t test (two-tailed). 
(C) Experimental schematic for the adoptive 
transfer of B cells with an endogenous 
B cell receptor repertoire to compete with 
wild-type recipient cells. Representative flow 
cytometry plots on live B cells (DAPI-B220*) 
at day 21 after NP-KLH immunization, for 
gating transferred cells (CD45.2) of the 
indicated genotypes within GC B cells. Graph 
represents mice combined from three 
independent experiments, with 9 to 14 mice 
total per genotype and time point. Mean + SD, 
unpaired t test (equal SD, two-tailed). 

s.c., subcutaneously. 


Btg1°°54 induces MYC-associated biosynthetic 
programs in GC B cells 

To gain insight into the competitive advantage 
conferred by mutant Btgl, we next performed 
RNA sequencing (RNA-seq) in Q36H and CRE- 
control GC B cells (fig. S8A). As noted earlier, 
total abundance of Bigi transcripts was simi- 
lar in both groups (fig. S8B). Only ~15% of Btgi 
transcripts carried the Q36H mutation (fig. S8C), 
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which is comparable to the fraction of mutant 
BTGI transcripts (17 to 45%) in BTG12°°"! 
DLBCL patients (7 = 6; fig. S8D). We detected 
no significant differences in gene expression 
profiles using both unsupervised and supervised 
analyses, except for the expected reduction in 
Rosa26 due to the knock-in (fig. $8, E and F). 
Bulk RNA-seq represents a composite of hetero- 
geneous GC B cells in their various transitional 
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states, which could mask effects occurring in 
subpopulations of cells. We therefore per- 
formed gene set enrichment analysis (GSEA) 
using MSigDB Hallmark and Canonical Path- 
way gene set collections (34-36) to determine 
whether we could detect subtler transcriptional 
perturbation in groups of coordinately regulated 
genes. We identified positive enrichment of 94 
signatures [false discovery rate (FDR) < 0.05], 
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most of them related to activation of biosyn- 
thetic pathways, mitochondrial function, and 
MYC target genes (Fig. 3A and table S3). A 
network connectivity analysis revealed that 
MYC targets were among the highest con- 
nected signatures, suggesting a central role for 
MYC (Fig. 3A). 

Myc is induced transiently together with 
mTORC1 activation upon receiving strong Ty; 
cell help during selection in the anatomical- 
ly defined GC light zone (LZ) (3-5). Myc and 
mTORC1 biosynthetic programs trigger GC 
B cell growth, a prerequisite for their subse- 
quent clonal expansion in the GC dark zone 
(DZ). Accordingly, we observed significant en- 
richment for signatures induced by Try cell 
help in LZ-to-DZ recycling GC B cells, as well as 
MYC and mTORCI programs in Q36H versus 
CRE (Fig. 3B and table S4). The same signatures 
were significantly enriched among human 
BTG1 mutant DLBCL patients as compared 
with BTG wild-type DLBCLs in two indepen- 
dent cohorts (excluding BTG2 mutant patients; 
Fig. 3B and fig. S8G). Among ABC-DLBCLs, 
BTGI1 mutant cases also showed higher expres- 
sion of these signatures (fig. SSH). For func- 
tional studies, we generated isogenic human 
DLBCL cells with ectopic and equivalent ex- 
pression of BTG1%™ versus BTG1” (fig. S8, 
I and J). In this model, BTG12?™ expression 
yielded a similar enrichment of LZ-to-DZ re- 
cycling, MYC, and mTORC1 signatures (fig. S8K). 
Furthermore, genes differentially induced [fold 
change (FC) > 1.5; FDR < 0.01; table S4] in 
BTG127" versus BTGI”? DLBCL cells were 
significantly and exclusively enriched for MYC 
and MAX DNA binding motifs (fig. S8L). The 
corresponding BTG1%™ signature was also 
significantly enriched in BTG1 mutant DLBCL 
patients (FDR < 0.001; fig. S8M), indicating 
consistency of BTG1 mutant effects between 
murine GC and human DLBCL cases. In line 
with enhanced biosynthetic rates, Btg1??°" 
GC B and BTG12**" DLBCL cells showed in- 
creased RNA content and cell size (Fig. 3, C 
and D, and fig. S8, N and O). These data sug- 
gested that BTGI mutation enhances biosyn- 
thetic fitness programs normally associated 
with positive selection and Myc/mTORC1 ac- 
tivation during the GC reaction, an effect that 
is maintained in established DLBCLs. 

MyYC-related signatures could be enriched 
because of either higher expression of Myc or 
expansion of Myc-positive (Myc*) GC B cells. 
To address this question, we crossed our Q36H 
mice to the Myc“ reporter line, in which en- 
dogenous Myc protein is fused to green fluo- 
rescent protein (GFP) (37). To better focus our 
analysis on GC B cells, we used the Cy“ line 
(38) to drive Btg!2?™ expression (Fig. 3E). The 
proportion of MycS'?* GC B cells was signif- 
icantly higher in Q36H versus CRE-control 
mice (Fig. 3F), yet Myc°*?* cells in Q36H and 


CRE mice manifested similar Myc” protein 


Mlynarczyk et al., Science 3'79, eabj7412 (2023) 


and Myc transcript levels (fig. S8, P and Q). As 
expected, RNA-seq analysis on Myc“?* versus 
Myc“'?- GC B cells showed strong enrichment 
of the LZ-to-DZ recycling, MYC, and mTORC1 
signatures in both Q36H and CRE (fig. S8R). 
However, comparing Myc* versus Myc"?~ 
cells from Q36H versus CRE mice revealed up- 
regulation of 201 transcripts in Q36H Myc?* 
(Fig. 3G and table S4). These genes were sig- 
nificantly enriched for Myc immediate early 
direct target genes (39), as well as genes acti- 
vated by Ty cell help through cytokine sig- 
naling (Fig. 3H). Thus, the fitness advantage of 
Btgl mutant GC B cells was linked to increased 
proportion of Myc-expressing cells and en- 
hanced activation of early Myc programs, with- 
out alteration of steady-state Myc expression 
levels per cell. 


Q36H impairs BTG1 association with MYC 
and other transcripts involved in 
LZ-to-DZ recycling 
We next explored how BTG12?™ specifically 
influences MYC expression and Ty; cell help- 
related transcriptional programs. The BTG1 
family proteins TOB1 and TOB2, which share 
the conserved N-terminal domain with BTG1, 
were recently shown to interact with RNAs 
(40). We thus performed RNA immunopreci- 
pitation (RIP) assays using V5-tagged BTGI™", 
BTG12*°", or enhanced GFP (EGFP) proteins 
as baits in our DLBCL cells (fig. S9A). As com- 
pared with EGFP negative control, BTGI"” en- 
riched ~3000 unique mRNA species, whereas 
BTG1?**" enriched only ~700 (fig. S9B). This 
effect was largely due to loss of BTG1”’- 
associated mRNAs (fig. S9C). Significantly 
reduced pull-down with BTG1°**"-V5 versus 
BTG1"-V5 was observed for 732 transcripts, 
which notably included the MYC mRNA 
(log2FC > 2, g < 0.05; Fig. 4A and table S5). 
These transcripts were highly enriched for 
LZ-to-DZ recycling and Myc“"?* signatures 
and marginally enriched for the centrocyte 
signature (which includes the small Myc* pop- 
ulation), but not for other GC or post-GC sub- 
populations (Fig. 4B). We further validated 
BTG1“"-MYC mRNA association and reduced 
enrichment with BTG12?™" in independent 
RIP experiments by quantitative polymerase 
chain reaction (qPCR; Fig. 4C). We next tested 
whether BTG1 directly binds RNA by ultra- 
violet (UV) cross-linking and immunoprecipita- 
tion (CLIP) of RNAs interacting with V5-tagged 
BTGI", BTG12*", and EGFP as compared 
with canonical RNA-binding proteins Musashi-2 
(MSI2) and human antigen R (HuR). No RNA 
was enriched with EGFP, and very little RNA 
and no enrichment for RNA-protein complexes 
right above the expected size of V5-tagged 
BTGI"" or BTG12*™ was observed, as opposed 
to MSI2 and HuR (fig. S9D). 

These data suggested that BTGI”” indirectly 
associates with transcripts involved in Ty cell 
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help and MYC programs to attenuate their 
expression, a function that is lost by BTG1 lym- 
phoma mutants. In line with this, expressing 
BTG1" in our isogenic DLBCL cells resulted 
in depletion of the LZ-to-DZ recycling, MYC, 
and mTORC1 signatures (fig. S9E) and reduced 
cellular RNA abundance, as compared with 
vector-control cells (fig. S9F). We categorized 
differentially expressed genes according to 
their behavior in BTGI™” and BTG12**" DLBCL 
cells. Many transcripts were regulated in a sim- 
ilar fashion (fig. S9G). However, genes repressed 
by BTG1” but not BTG1°*" were largely re- 
stricted to the LZ-to-DZ recycling, MYC, and 
mTORC1 signatures, and complementarily, 
genes induced by BTG1™" but not BTG1%?*™ 
were mostly enriched for MYC-repressed and 
anti-mTORC1 signatures (fig. S9G and table S6). 
Hence, wild-type BTG1 normally associates 
with MYC and transcripts relevant to LZ- 
to-DZ recycling GC B cells, an effect that is 
impaired by mutational disruption of its 
N-terminal domain. 


BTG1°3* lowers the threshold for MYC 
protein synthesis 


We wondered whether loss of MYC mRNA 
association upon BTG1 mutation might alter 
MYC protein induction kinetics, perhaps en- 
abling cells to reach high MYC plateau levels 
more rapidly and yielding the observed in- 
creased fraction of MYC* cells. BTGI1 can regu- 
late mRNA stability in T cells (47). However, 
MYC mRNA half-life was comparable in 
BIG1@" and BTGI”? DLBCL cells exposed 
to transcription inhibitor actinomycin D (fig. 
S10A). To investigate MYC protein expression 
kinetics, we placed BTGI@*™ and BTG1”” iso- 
genic DLBCL cells under serum deprivation 
and performed Western blots for MYC and 
BTGI (by means of the V5 tag) at serial time 
points between 15 min and 2 hours after re- 
feeding (fig. SIOB). MYC was induced >1.5-fold 
between 15 and 30 min in BTG12*™" but did 
not change significantly in BTG1”’-expressing 
cells (fig. SIOB). MYC mRNA abundance re- 
mained unchanged under similar conditions 
(fig. S10C). To further investigate the increased 
MYC protein levels, we treated our DLBCL 
cells with cycloheximide (CHX) to inhibit pro- 
tein synthesis, which revealed no effect of 
BTG1%*" on MYC protein stability (fig. S1OD). 
We then performed an MG132 treatment time 
course to inhibit protein degradation. This 
showed an earlier and higher accumulation 
of MYC protein levels in BTG12?™ cells (a 
4.5-fold peak at 2 hours MG132 versus dimethyl 
sulfoxide (DMSO) in BTG1@*"' cells compared 
with a 3.8-fold peak at 4 hours MG132 versus 
DMSO in BTGI™" cells; fig. SIOE). Both BTG1”* 
and BTG1%*" protein levels decreased upon 
reaching high MYC protein levels, consistent 
with reports that Myc can down-regulate Btg1 
(42) (fig. SIOE). Accordingly, Big] mRNA levels 
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Fig. 3. Mutant BTG1 induces MYC-related biosynthetic programs. (A) Gene set 
enrichment analysis (GSEA) network showing canonical and hallmark gene sets 
positively enriched in Q36H versus CRE up-regulated genes [normalized enrichment 
score (NES) = 1.25; FDR < 0.05]. Distance between two enriched gene sets was 
calculated as the Jaccard index (size intersection: size union) using leading-edge 
genes. The circle size represents the degree of connectivity to which each gene set is 
attached to others. (B) GSEAs showing LZ-to-DZ recycling GC B, MYC, and mTORC1 
activation signatures enrichment in Q36H versus CRE mouse GC B cells (left) and in BTG1 
mutant versus WT DLBCL cases in the BC Cancer Agency (BCCA) cohort (right) (23). 
LZ-to-DZ recycling GC B signature is DECP_UPREG [n = 221 mouse genes (left); n = 201 
human orthologs (right)]; MYC signature is SCHUHMACHER_MYC_TARGETS_UP 

[n = 80 mouse orthologs (left); n = 75 genes represented in the BCCA dataset (right)]; 
mTORCI signature is PENG_RAPAMYCIN_RESPONSE_DN [n =242 mouse orthologs 
(left); n = 230 genes represented in the BCCA dataset (right)]. Enrich., enrichment score. 
(C) Graph showing quantification of RNA extracted from 200,000 mouse GC B cells. 
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One experiment, n = 4 and 5 mice per genotype. Mean + SD, unpaired t test (equal SD, 
two-tailed). (D) Representative flow cytometry histograms showing cell size measure- 
ment as forward scatter area (FSC-A) in GC B (FAS*CD88 ) and non-GCB (FAS CD38") 
cells (left). Graph showing geometric mean (gMean) of FSC-A from one experiment, 
n=5 and 6 mice per genotype, representative of at least three independent experiments 
(right). Mean + SD, unpaired t tests (two-tailed). (E) Experimental schematic and 
representative flow cytometry plots showing Myc°'* cells (green) in centroblasts 

(CB) and centrocytes (CC). (F) Bar plot showing proportion of Myc*?* cells in CC. 
One experiment with n = 3 and 4 mice per genotype, representative of at least three 
independent experiments. Mean + SD, unpaired t test (two-tailed). (G) Genes (n = 201) 
depicted with red dots present higher expression (FC > 1.2) in GFPM** versus GFPY- 
GC B cells comparing Q36H mice (y axis) to CRE mice (x axis). (H) Hypergeometric 
mean analysis for the 201 genes identified in (G). The Myc immediate early signature 
consists of Myc-dependent genes induced by 8 hours in Myc WT/flox B cells (39). Genes 
in this and other gene signatures are included in supplementary table S4. 
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Fig. 4. Q36H mutation disrupts BTG1-MYC mRNA association and enhances 
MYC protein synthesis kinetics. (A) Heatmap showing n = 732 BTGI™'- 
associated (BTG1"" RIP/input log2FC > 2, q < 0.05) and BTG12°*"-lost 
(BTG12°4/BTG1"" log2FC > 2, q < 0.05) transcripts by RIP-seq, using V5 
antibody in SU-DHL4 DLBCL cells overexpressing V5-tagged BTG1"" or 
BTG1°°°"" MYC and leading edge genes from positively enriched LZ-to-DZ 
recycling GC B, MYC, and mTORC1 signatures in BTG12°*4 versus BTG1"" are 
labeled. (B) Signatures enriched for up-regulation of the n = 732 BTG1"- 
associated and BTG12°*#-lost transcripts as determined by hypergeometric 
mean analysis. (€) RIP-qPCR for MYC mRNA in indicated SU-DHL4 V5-tagged 
DLBCL cells. Mean + SD of enrichment in RIP over input samples, normalized 
to BTG1'-V5. Three independent experiments, each with n = 2 or 

3 independently generated lines per genotype. Unpaired t test (equal SD, two- 
tailed, versus EGFP or as indicated). (D) Representative MYC flow cytometry 
histograms of SU-DHL4 DLBCL cells expressing V5-tagged BTG1" (WT-V5) or 
BTG124 (Q36H-V5) and treated with 5 uM MG132 for 0 or 2 hours. Proportions 
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of MYC-positive cells (MYC*) are indicated. Staining with isotype control is 
included. Graphs show proportion of MYC* cells (left) or MYC protein levels 
within MYC* cells (right) as measured by flow cytometry in three independent 
experiments, each with n = 2 or 3 independently generated lines per genotype, 
shown as fold change relative to WT-V5 O hour mean value per experiment. 
Mean + SD, two-way analysis of variance (ANOVA) for Q36H-V5 versus WT-V5. 
gMeanFl, geometric mean fluorescence intensity. (E) Polysome profile of 
BTGI™-V5 and BTG12°"-V5 expressing SU-DHL4 cells. Mean (line) + SD 
(shade) from n = 3 independently generated lines per genotype except fractions 
4 to 9: n = 2 Q36H lines owing to technical loss of one line. One experiment 
representative of two independent experiments. (F) Bar plot shows the area 
under the curve (AUC) for indicated portions of the polysomal traces from (E), as 
percent of the total AUC average per genotype. Mean + SD, unpaired t test 
(equal SD, two-tailed). (G) Absolute MYC mRNA levels from fractions in (E) and 
(F), pooled as indicated. n = 2 independently generated lines per genotype. 
Mean + SD, unpaired t test (equal SD, two-tailed). 


were reduced in Myc“"?* versus Myc“"?- GC 
B cells (fig. SIOF). To measure MYC protein 
levels per cell, we performed MYC flow cytom- 
etry analyses in similar MG132 time courses. 
The fraction of MYC" cells was larger (~1.5-fold) 
in BTGI®™ cells than in BTG1™” cells through- 
out the time course and was accompanied by a 
mild and faster up-regulation of MYC protein 
abundance in MYC‘ cells (Fig. 4D and fig. SIA). 
MYC mRNA levels remained comparable be- 
tween BTG12**" and BTGI” cells (fig. SIIB). 
These observations suggest that BTG127°" 
lowered the threshold for MYC protein syn- 
thesis, resulting in a greater fraction of cells 
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expressing MYC. Three additional DLBCL- 
recurrent BTG1 mutants (S43N, A49T, and 
T391) accelerated MYC protein induction in 
a similar fashion (fig. S11C), and this effect 
was also observed when we expressed BTG1?**" 
versus BTG1”” in freshly immortalized pri- 
mary human GC B cells (43) (fig. S11, D to G). 

To directly measure MYC mRNA engage- 
ment with the translational machinery, we 
carried out polysome profiling in our isogenic 
BTG1"" and BTG1®**" DLBCL cells. We con- 
firmed that polysomal fractions were enriched 
for ribosomal proteins (e.g., RPS3; fig. S12A). 
BTG12*" cells showed higher RNA abun- 
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dance in polysomal fractions as compared 
with BTG1"" cells, indicative of increased 
overall translation (Fig. 4E and quantification 
in Fig. 4F). BTG1®*™ cells also contained a sig- 
nificantly greater abundance of MYC tran- 
scripts within polysomal fractions (Fig. 4G and 
fig. S12B). Collectively, these data suggested 
that wild-type BTGI1 association with MYC 
mRNA may restrict MYC translation, an effect 
that is specifically disrupted by BTGI mutations. 
We hypothesized that this might reduce the 
threshold for Ty cell help-dependent MYC 
induction, thereby enhancing the response 
of GC B cells to Try cell help signals. 
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More-rapid S phase completion and DZ 
commitment by Big1°3*" LZ-to-DZ recycling cells 
The degree of Tp; cell help, and therefore Myc 
induction, determines the ability and speed of 
GC B cells to transit through S phase and sub- 
sequently undergo proliferative burst in the 
DZ (44, 45). Hence, we predicted that enhanced 
Myc kinetics due to Big7 mutation would en- 
able GC B cells to more efficiently progress 
through S phase and into the DZ proliferative 
program. To evaluate whether and how such 
effects might manifest in the heterogeneous GC 
milieu, we performed targeted single-cell RNA- 
seq (n = 496 immune-related genes; table S7) on 
competing B1-8"/Btg1 and B1-8"/CREneg 
control GC B cells sorted at day 10 after NP 
immunization, when Q36H cells outcompete 
control cells by ~’70 to 30% (Fig. 5A and fig. S6A). 
A general overview (2982 cells across three 
biological replicates) revealed a similar general 
distribution of individual cell gene expression 
profiles, without bias toward genotype or bio- 
logical replicate (fig. S13, A and B). Individual 
cells were assigned to specific GC B cell sub- 
populations (DZ, LZ, or LZ-to-DZ recycling) 
according to their distinctive signatures and 
marker transcript levels (Fig. 5B). 

LZ-to-DZ recycling cells enter S phase but 
generally do not engage G2/M until transit- 
ing into the DZ and expressing the correspond- 
ing centroblast (CB) transcriptional program 
(44). Accordingly, S and G2/M phase signa- 
tures were mostly confined to the DZ com- 
partment (fig. S13C). To compare cell cycle 
progression of Q36H versus control GC B cells, 
we plotted their frequency according to cell 
cycle program expression, across a pseudo- 
time scale starting from G1 phase (Fig. 5C). 
We found a significant increase in Btg1?°" 
cell density from S phase entry and through- 
out G2/M (P = 0.0071; Fig. 5C). As expected, 
cells expressing the LZ-to-DZ recycling prog- 
ram localized at the Gl-to-S transition and 
extended into G2/M along this pseudotime 
axis (pseudotime units 9 to 14; Fig. 5D). We 
then projected the DZ and G2/M signatures 
onto the same axis. GC B cells at the LZ-to- 
DZ recycling pseudotime transition showed 
earlier induction and greater proportion of 
Btgl?*" cells having initiated DZ (P = 0.0117; 
Fig. 5E) and G2/M (P = 0.0202; Fig. 5F) pro- 
grams. Along these lines, we identified 28 
genes with differential expression in Btg1?™ 
versus control cells, which mainly corresponded 
to changes in the proportion of cells expressing 
these genes (chi-square P < 0.05) and with most 
genes being related to cell cycle and lymphoma 
(fig. S13D). Therefore, Btg1?°°" LZ-to-DZ re- 
cycling GC B cells engaged the proliferative DZ 
program earlier and in greater proportions. 

These single-cell profiles raised the question 
of whether Btg1 mutant GC B cells may com- 
plete S phase more rapidly or at a higher rate. 
To explore this, we measured cell cycle kinetics 
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in competition experiments. To distinguish be- 
tween entry, mid-late, and post S phase cells, 
we injected 5-ethynyl-2'-deoxyuridine (EdU) 
and 5-bromo-2’-deoxyuridine (BrdU) at 1.5 hours 
and 0.5 hours before euthanasia, respectively 
(46) (Fig. 5G). We validated the expected ad- 
vantage of Btg1*" GC B cells under these con- 
ditions, with no change in DZ:LZ polarity nor 
advantage among non-GC B cells (fig. $13, E 
and F). We observed no difference in S phase 
entry (BrdU* only) in either centrocyte (CC) or 
centroblast (CB) populations (Fig. 5G). In contrast, 
there was a significant increase in Btg1®*°" CC 
and CB progressing through mid-late S phase 
(BrdU*EdU*) and in Btg!?°™" CB post S phase 
(EdU* only; Fig. 5G). These observations sug- 
gested that Btg1 mutant GC B cells transited 
and completed S phase more rapidly than con- 
trol cells. To distinguish whether such effects 
were specific to the LZ-to-DZ recycling GC 
B cells, we next performed in vivo Edu incorpo- 
ration assays in our Q36H/Cy1“*/Myc*? mice 
to detect S phase-experienced cells in Myc@"?* 
versus Myc“"?~ GC B populations. To avoid 
losing the small number of Myc“"?* GC B cells 
during EdU staining, sorted cells were fixed 
and stained by immunofluorescence on slides 
(Fig. 5H and fig. S13G). A significantly higher 
proportion of EdU* cells was detected in Q36H 
versus CRE Myc“!?* GC B cells (P = 0.0304; 
Fig. 5H), whereas there was no change in Q36H 
versus CRE Myc“* cells. This suggested that 
Btg1?*" induced an S phase transit advantage 
specifically to LZ-to-DZ recycling GC B cells. 


Competitive advantage conferred by Btg1°%64 


links to faster LZ-DZ kinetics 


Our data so far suggest that Btg7 mutation 
provides a GC fitness advantage by reducing 
the threshold to Ty cell help response, as man- 
ifested by facilitated Myc protein induction, 
DZ program commitment, and S phase com- 
pletion. To determine whether these changes 
could explain the Btg1®**"' fitness advantage, 
we used established in silico mathematical 
models that reflect different aspects of GC 
temporal and clonal selection dynamics, in- 
cluding B cell receptor and CD40 signaling, 
antigen uptake, and Myc and mTORC1 induc- 
tion (47). Comparing multiple different com- 
binatorial perturbations of these signals, we 
found that providing GC B cells with a modest 
increase in the speed of Ty; cell help response, 
corresponding to a 1.12-fold increase in the 
rate of Myc induction and mTORC1 activation, 
recapitulated the Btg!2?“" competitive advan- 
tage observed in vivo (fig. S14A). However, 
under noncompetitive conditions, this model 
predicted higher GC and CB proportions for 
Btg1 mutant (fig. S14A), two features that were 
not observed in vivo (see figs. S4. and S5). Taking 
into account our observation of a more rapid S 
phase completion, we determined that shorten- 
ing the S phase by 21%, in addition to providing 
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1.12-fold faster Tyy cell help response, recapit- 
ulated the competitive advantage of Bte1?°™ 
GC B cells without altering GC volume and 
polarity under noncompetitive conditions, 
more accurately reflecting the Btg1@°™" in vivo 
phenotypes (fig. S14B). These parameters fur- 
ther predicted that Btg1 mutant GC B cells 
would undergo an increased number of GC 
rounds over the course of the GC reaction (P < 
2 x 107° at day 10; Fig. 6A). 

To test this scenario functionally, we synchro- 
nized competing Q36H and WT GC B cells re- 
ceiving targeted Ty; cell help in vivo and tracked 
their LZ-to-DZ kinetics. T cell help can be 
elicited by injecting an OVA-conjugated anti- 
DEC205 antibody (aDEC205-OVA), which binds 
to the B cell surface receptor DEC205 and 
delivers OVA to the major histocompatibility 
complex class II (MHC-II) presentation path- 
way, inducing strong interactions with OVA- 
specific Try cells (44) (Fig. 6B). We adoptively 
transferred 3% Q36H (B1-8""*/-; DEC205*/*; 
R26! BSIQ6H/+. Cq7gTe/*, CD45.1) and 7% WT 
(fluorescent B1-8"!*/-; DEC205*/*; CD45.1) rest- 
ing B cells, to reach a 50:50 proportion of Q36H 
and WT cells at 6 to 7 days into the GC reaction. 
The remaining 90% were B1-8"!*/-; DEC2057"~; 
CD45.1/2 cells that lack the DEC205 receptor 
and do not receive targeted Ty cell help, to 
better reflect physiological conditions where 
Tru cell help is only available to a small fraction 
of GC B cells (Fig. 6C). In this three-way com- 
petition system, the different transferred cell 
populations were distinguished using a com- 
bination of congenic and fluorescent markers. 
Recipient CD45.2 mice were primed with OVA 
before the B cell transfer to generate OVA- 
specific Ty, cells. The day after transfer, mice 
were boosted with NP-OVA to recruit OVA- 
specific recipient Ty cells and NP-specific 
B1-8" transferred B cells into the GC reaction. 

We delivered Tyy cell help to competing 
DEC205*/* Q36H and WT cells 6 to 7 days into 
the GC reaction by injecting aDEC-OVA. We 
then measured the proportion of GC B cells in 
LZ (CC) and DZ (CB) over time (0, 36, 48, and 
72 hours) after Ty cell help delivery (Fig. 6, 
C and D). In line with previous work (44), 
DEC2057/~ cells remained enriched in the LZ 
over this time course, showing a stable DZ:LZ 
ratio, and did not expand after anti-DEC205- 
OVA injection, whereas DEC205*”* cells moved 
to the DZ (as CB) and expanded between 48 
and 72 hours (Fig. 6, D and E, and fig. S14, C 
and D). Within DEC205*/* cells, Ty; cell help 
induced a more robust increase in DZ:LZ ratio 
for Q36H GC B cells as compared with WT, 
peaking at 48 hours (P = 0.0187; Fig. 6E and 
fig. S14). These results reflected a greater DZ 
reentry for Btg1?°*" GC B cells upon receiv- 
ing Try Cell help signals. Collectively, these 
data suggested that subtle acceleration of Try 
cell help-related Myc induction and S phase 
completion led to a progressive and marked 
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Fig. 5. Faster S phase completion and DZ program commitment in 
Btg12°*4 | 7-to-DZ recycling cells. (A) Experimental design for targeted single- 
cell RNA sequencing for n = 496 genes in competing Q36H and CREneg GC 

B cells from n = 4 mice pooled into three biological replicates. (B) Dark zone 
(DZ), light zone (LZ), and LZ-to-DZ recycling GC B cells were defined on the 
basis of signatures enrichment and of Mki67 and Cd86 marker expression 
levels, projected onto the uniform manifold approximation 
(UMAP) distribution of cells (n = 2982). Centroblast (CB) 
signatures correspond to genes down-regulated or up-reg 
p-regulated in Myc 
versus Myc*'® CRE control GC B cells from Fig. 3E; mTORC1 signature is 
PENG_RAPAMYCIN_RESPONSE_DN; DECP signature is LZ_DECP_upreg and Myc 
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greater DZ reentry potential and an opportu- 
nity to undergo increased GC repetitive rounds 


Representative IF images are shown 
independent experiments, each with n 
represents one individual field of view 


. Bar plot shows quantification from two 
= 2 or 5 mice per genotype. Each dot 
with a minimum five cells per field of view. 


s by Wilcoxo' 


n test. 


Mean + SD, unpaired t test (two-tailed). 


(Fig. 6F). 
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Btg1°3° expression induces highly aggressive 
and rapidly progressive lymphomas 

We next explored whether Btg/°™" is a bona 
fide lymphoma oncogene. Given that MCD/ 
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cluster 5 DLBCLs derive from B cells having 
transited the GC reaction, we induced Btgi?*°™ 
expression using Cy“. MCD-DLBCLs typically 
express high BCL2 levels (J). We crossed our 
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Fig. 6. Accelerated LZ-DZ recycling kinetics in Btg1°°°" GC B cells. (A) In 
silico readout for the number of LZ-DZ cycles undergone by competing Btgl mutant 
versus WT GC B cells at day 10 after immunization, using the GC mathematical 
model in which mutant GC B cells are provided with 1.12 times faster T cell help 
response (faster up-regulation of Myc and mTORC1) and 21% shorter S phase. 
Each dot is a readout from a single simulation (n = 300). Difference by Wilcoxon 
test considering the mutant and WT readout from the same simulation as paired 
samples. (B) Schematic of the system used to deliver targeted Try cell help 


to competing GC B cells in vivo. (C) Experimental desig 


mice to the VavP-Bcl2 lymphoma mouse model 
(48, 49) to reflect this biology. We generated 
cohorts of Bcl2, Bcl2+Q36H, Q36H, CREneg, 
and CRE mice through bone marrow hemato- 
poietic stem cell transplantation into syngeneic 
wild-type recipients (Fig. 7A). Bcl2+Q36H ani- 
mals displayed shorter survival than did Bcl2 
animals (P = 0.0005; Fig. 7B). Notably, survival 
outcomes were indistinguishable between 
Q36H-only mice and CREneg or CRE con- 
trol groups (P > 0.47; Fig. 7B). As opposed to 
Bcl2+Q36H, Bcl2+WTKI animals did not ex- 
perience more-rapid mortality rates than Bcl2 
animals, in a similar transplantation cohort 
using fetal liver hematopoietic stem cells (fig. 
S15, A to C). 

To determine whether mutant Btgl accele- 
rated disease onset, we analyzed a subset of 
seemingly healthy Bcl2+Q36H and Bcl2 ani- 
mals (n = 5 per group) at 8 months after 
transplantation. Bcl2+Q36H mice manifested 
increased splenomegaly (P = 0.0257; Fig. 7C 
and fig. S15D). Histologically, Bcl2 spleens 
showed a prominent follicular pattern but 
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n for the targeted delivery 


otherwise largely intact architecture, whereas 
Bcl2+Q36H animals manifested distortion and 
expansion of B cell follicles (B220*) and GC 
areas (PNA*; Fig. 7D). GC B cells (FAS*CD38°) 
were increased in proportion to B cells (B220*; 
fig. SI5E), were larger (fig. SI5F), contained a 
higher fraction of mTORC1-activated cells 
(phosphorylated S6 (pS6*); fig. SI5G], and 
were more clonal (IgVH PCR on cDNA; fig. 
S15H) in Bcl2+Q36H versus Bcl2 spleens. Un- 
supervised analysis of RNA-seq performed in 
these lymphoma cells showed markedly dis- 
tinct transcriptional profiles (fig. S151). Mutant 
BTG1-induced signatures such as positively se- 
lected GC B cells and MYC targets were sig- 
nificantly enriched (FDR < 0.001 and 0.005, 
respectively), suggesting maintenance of BTG1 
mutant effects (fig. S15J). Lymph node archi- 
tecture was severely disrupted in Bcl2+Q36H 
mice, with expansion of extrafollicular areas 
containing enlarged and proliferative (Ki67*) 
lymphoma cells, as compared with Bcl2 (Fig. 
7E). Most notably, Bcl2+Q36H mice displayed 
invasion of malignant B cells into extranodal 
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of Try cell help to competing DEC205*”* WT and Q36H GC B cells in vivo and tracking 
their centroblast (CB) versus centrocyte (CC) identity over time. (D) Representative 
gating flow cytometry plots for C. (E) Graph shows pooled data for (C) and (D), 
from two experiments, each with n = 3 to 5 mice per time point. Paired t test, 
two-tailed. (F) Schematic model. More-rapid GC LZ-DZ cycles in Btgl mutant 

(MUT) GC B cells, because of faster response to Try cell help via accelerated Myc 
protein induction, more rapid S phase completion and earlier commitment to DZ 
transcriptional program, explain the progressive competitive fitness gain of Btg 
GC B cells over the course of the GC reaction. Rec, LZ-to-DZ recycling. 


1036H 


tissues such as the liver, kidney, and lung, 
whereas Bcl2 mice only manifested minor peri- 
vascular infiltrates in these tissues (Fig. 7F and 
fig. S15K). 

To assess Btg!® effect on the genetic tra- 
jectory during lymphomagenesis, we performed 
B cell receptor (BCR) sequencing (fig. S16, A 
and B). All lymphomas harbored somatic hy- 
permutation activity, typical of their GC origin 
(fig. S16C). Bcl2+Q36H lymphoma B cells man- 
ifested significantly greater mutational burden 
(fig. SI6D) and clonal expansion when com- 
pared with Bcl2, as represented by their lower 
Simpson’s index and expanded phylogenetic 
trees (Fig. 7G and fig. S16, E and F). Further- 
more, Bcl2+Q36H clonal lineages presented 
lower estimated replacement-to-silent muta- 
tion rate ratios (dN/dS) than Bel2, which is 
indicative of purifying selection and consistent 
with more cycles of division and mutation in 
germinal centers (50) (fig. S16G). In line with 
this, mathematical modeling (fig. S14B) pre- 
dicted a higher number of divisions in selected 
mutant versus wild-type GC B cells over the 
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Fig. 7. Mutant BTG1 drives formation of highly aggressive B cell lymphomas 
in mice and humans. (A) Experimental design for testing contribution of the 
R26! Btsl036H/* allele to Bcl2-driven lymphomagenesis. (B) Kaplan-Meier curves 
depicting overall survival of groups described in (A) in days after transplantation, 
assessed by either time of death or euthanasia upon sickness development. 
Log-rank (Mantel-Cox) test performed as indicated. (C) Image of n = 5 spleens 
per group at 8 months after transplantation. (D) Consecutive spleen sections from 
Bcl2 and Bcl2+Q36H mice at 8 months after transplantation stained with H&E or by 
immunohistochemistry (IHC) for B cells (B220 antibody) or GC B cells (PNA). 
Representative of n = 5 mice per genotype. (E) Consecutive lymph node sections 
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from Bcl2 and Bcl2+Q36H mice at 8 months after transplantation stained with 


H&E or by IHC for proliferating cells (Ki67). Representative of n = 5 mice per genotype. 


(F) Consecutive liver sections from Bcl2 and Bcl2+Q36H mice at 8 months after 
transplantation stained with H&E or by IHC for B cells (B220). Representative 

of n = 4 mice per genotype. (G) B cell lineage trees from BCR sequences analysis 
(ImmunoSeq) in Bcl2 and Bcl2+Q36H mice at 8 months after transplantation. 
The B cell clone with the most sequence reads for each mouse (mouse number 
indicated in the bottom) is shown. Branch lengths represent the estimated number 
of somatic hypermutations between nodes. Scale bar denotes 10 mutational events. 
Trees from n = 2 mice, representative of n = 5 mice per genotype (fig. S16F), are 
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shown. (H) H&E staining on indicated tissue sections from Bcl2 and Bcl2+Q36H 
moribund mice. Representative of n = 6 mice per genotype. Low-grade follicular 
lymphoma-like centrocytes shown in Bcl2 mice. In Bcl2+Q36H mice, white 
arrowheads depict large immunoblastic cells; yellow, plasmacytoid cells; and orange, 
a large-size DLBCL-like cell. (1) Kaplan-Meier curves depicting overall survival of 


course of the GC reaction (fig. SIGH). Collect- 
ively, lymphoid architecture disruption and 
invasion of extralymphoid tissues by more 
proliferative, dysplastic, highly mutated, and 
clonally expanded tumor cells indicated more 
advanced and phenotypically malignant dis- 
ease in Bcl2+Q36H mice, with the genetics 
indicating that Btg1?°°" B cells underwent 
more rounds of selection and somatic hyper- 
mutation, in line with their fitness advantage. 

Further along these lines, moribund Bcl2 mice 
presented with low-grade follicular centrocyte- 
like lymphoma patterns, whereas moribund 
Bcl2+Q36H mice featured sheets of larger 
lymphoid cells, reminiscent of DLBCL with 
many cells showing immunoblastic or plasma- 
cytoid appearance (Fig. 7H). These differential 
features were observed in the spleen, as well 
as extranodal tissues (Fig. 7H). Bcl2+Q36H 
mice also uniquely manifested cases of ex- 
treme disease invasiveness into peripheral 
organs (fig. S161, underlining the impact of 
Btg!?*" in driving phenotypically high-grade 
aggressive lymphomas, with features similar 
to human cases. 

Consistent with these aggressive pheno- 
types, we observed inferior clinical outcomes 
for BTGI mutant patients within ABC-DLBCLs 
from publicly available cohorts (9, 23, 24) (P = 
0.0011; Fig. 71). Univariable Cox regression 
further showed significant association between 
BTGI mutational status and inferior overall 
survival among ABC-DLBCLs (P = 0.0013; Fig. 
7J and fig. S16J). Notably, a multivariable Cox 
regression analysis, including DLBCL genetic 
subtype (LymphGen), gender, International 
Prognostic Index (IPI), and BTG1 mutation 
status as covariates, showed that BTGI muta- 
tion remained significantly associated with 
overall survival (P = 0.0190; Fig. 7J and fig. 
S16J). Collectively, these data indicated that 
BTG1 mutation presents potent oncogenic 
properties. Consistent with its role in providing 
supercompetitive fitness to GC B cells, mutant 
BTGI conferred an aggressive and tissue-invasive 
phenotype to B cell lymphomas, resulting in ex- 
tranodal spread and inferior clinical outcomes. 


Discussion 


Multicellular life requires individual cells to 
sacrifice competitive fitness to ensure homeo- 
stasis and survival. Mechanisms that enable 
“altruistic” cooperation between cells include 
restraining cell proliferation and preventing 
acquisition of somatic mutations among other 
functions and are often viewed as weakened in 
cancer (57). However, certain nonaltruistic, com- 
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petitive processes take place in normal em- 
bryonic development (52). The archetypal ex- 
ample of supercompetition was first described 
during Drosophila development, whereby cells 
with greater Myc expression outgrow and may 
also cull neighbor cells with slightly lower Myc 
levels (53, 54). The GC reaction provides an 
intriguing microcosm of the cancer “return to 
unicellularity” concept in normal nonembryonic 
physiology, whereby restraints on prolifera- 
tion and somatic mutations are attenuated 
and a Myc-dependent competitive process of 
clonal diversification and selection takes place. 
Critical to keeping this process under control, 
expression of Myc is permitted only tran- 
siently in GC B cells, together with mTORC1 
activation, to provide time-limited biosynthe- 
tic capabilities (5). 

The absolute arbiter of Myc induction in GC 
B cells is the intensity of Ty cell help signals 
received on the basis of affinity for cognate 
antigen. The resulting range of Myc “doses” can 
determine initial S phase duration and how 
many divisions a selected GC B cell undergoes 
(45, 55). Our findings suggest that in addition 
to Myc dosage, the rate of Myc protein induc- 
tion may influence GC B cell fitness, because it 
is associated with earlier S phase completion, 
greater DZ reentry, and earlier activation of 
proliferative programs. Naive B cell activation 
models also suggest a critical function for Myc 
induction kinetics in determining B cell ex- 
pansion and cell fate (56). However, the com- 
petitive gain conferred by mutant Btgl was 
restricted to the GC B cell compartment. Thus, 
analogous to embryonic supercompetition, GC 
B cells with slightly superior Myc induction 
kinetics have the potential to outcompete 
and dominate the GC reaction through clonal 
expansion. 

Given these scenarios, it is plausible that 
B cells would have evolved ways to control 
immune signaling thresholds for Myc induc- 
tion, especially because GC B cells are exposed 
to an environment rich in growth-promoting 
cytokines and ligands. Our data suggest that 
wild-type BTG1-mediated restriction of MYC 
translation may represent one such mecha- 
nism, acting as a safeguard or gatekeeper 
against unrestrained “unicellular-like” fitness. 
Mutant BTGI1 released MYC from this restric- 
tive effect, thus converting GC B cells into 
supercompetitors that transformed into ex- 
tremely aggressive lymphoma cells. Moreover, 
Btgl mutant cells aberrantly retained MYC 
signatures while activating DZ and G2/M 
programs, suggesting that biosynthetic growth 
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ABC-DLBCL patients combined from publicly available cohorts (9, 23, 24), based on 
their BTG1 status. Log-rank test was performed. Censored events are indicated 
below the graph. (J) Univariate and multivariate analyses in ABC-DLBCL cases from 
the same combined cohorts as in (I). HR, hazard ratio with 95% confidence interval. 
P value from Wald chi-square test for HO:HR = 1. 


and proliferation abnormally coexisted rather 
than being strictly separated. Future studies 
will determine whether this supercompeti- 
tion effect is further associated with eliminat- 
ing wild-type cells, perhaps depriving them 
from positive selection signals or resource utili- 
zation in their microenvironment. The impact 
of BTGI mutation on the magnitude of Try 
cell help response and MYC induction kinetics 
was quite subtle, implying that competing 
GC B cells operate at the most extreme limits 
of multicellular homeostasis, a small pertur- 
bation of which is sufficient to tip the balance 
toward malignancy. The iterative nature of the 
GC reaction may further amplify this subtle 
effect over time. These findings suggest a del- 
icate evolutionary trade-off between defend- 
ing vertebrate organisms from infections versus 
facilitating certain “atavistic” features in B cells 
that recall the more intense and dynamic fea- 
tures of unicellular life. 

In T cells, BTG1 was reported to maintain 
quiescence through global mRNA degrada- 
tion (47) and interaction with CCR4-NOT 
(5, 20, 57), a deadenylase complex regulating 
mRNA stability. In HEK293T cells, certain 
BTGI1 lymphoma mutants manifested impaired 
interaction with CCR4-NOT (20). However, 
mutant BTGI did not affect MYC mRNA levels 
or mRNA stability in GC or DLBCL cells. The 
BTG family member TOB1 potentially binds 
mRNAs through 3’ untranslated region (3’UTR), 
5'UTR, and coding sequences, suggesting func- 
tions beyond mRNA stability regulation (40). 
However, BTG1 lacks TOB1 C-terminal domain 
or any known RNA binding domain and in- 
stead associated with RNAs indirectly. Given 
that BTG1 mutants facilitated translation of 
MYC and potentially other proteins, we hy- 
pothesize that wild-type BTG1 association 
with MYC mRNAs restricts MYC translation, 
until reaching a critical Try cell help strength 
threshold, whereas mutant BTG1 lowers this 
threshold through impaired association with 
MYCmRNASs. These effects are likely mediated 
through interaction with as of yet unknown 
RNA binding proteins that might be destabi- 
lized by the altered protein conformation in- 
duced by mutations such as Q36H (37). This 
would confer a partial dominant-negative ef- 
fect, whereby mutant BTG1 could occupy wild- 
type BTGI space, preventing its ability to form 
functional RNA regulatory complexes while 
preserving other BTG1 functions. The domi- 
nant effect of these mutations is further under- 
lined by the relatively low fraction of BTGI 
mutant transcripts in primary human DLBCLs 


11 of 23 


RESEARCH | RESEARCH ARTICLE 


and our animal models. Although B7G1 is de- 
scribed as a putative tumor suppressor gene in 
several cancer types (58), it behaved instead as 
a proto-oncogene in DLBCL. This is indicated 
by its functional gain-of-fitness effects and 
focal mutational pattern, but only rare trun- 
cating mutations or loss of heterozygosity. 
Other genes can act both as tumor suppres- 
sors and proto-oncogenes depending on the 
cellular context (59, 60). For example, the poly- 
comb protein EZH2 is a tumor suppressor 
deleted in leukemia, as is BTG1 (61), but is an 
oncogene with gain-of-function mutations in 
B cell lymphoma (6). 

Myc can suppress Btg1 expression through 
induction of mzR-17-92 (42), suggesting a pos- 
itive feedback mechanism. Indeed, we ob- 
served that BTG1 expression was reduced in 
Myc-positive GC B cells or upon induction of 
MYC in DLBCL cells, and mzR-17-92 deficiency 
impairs GC responses (62). In line with antag- 
onistic effects, MYC and BTG] mRNA levels 
are inversely correlated during cell cycle pro- 
gression in stimulated peripheral blood lympho- 
cytes and NIH3T3 cells (63). However, there 
may be additional mechanisms through which 
MYC can repress BTGI1 expression, given that 
in our DLBCL cells, V5-tagged BTG1 lacked the 
3'UTR binding sites for miR-17-92. BigI tran- 
script levels are evidently tightly controlled in 
GC B cells because expression of Btg1@°" or 
Btg1”" from the Rosa26 locus did not affect 
overall Big] transcript abundance. 

Finally, the lymphomagenic potential of 
mutant BTG1-induced fitness was underlined 
by its powerful cooperative effect with BCL2 
overexpression, both of which are associated 
with the MCD-DLBCL subtype. The compe- 
titive advantage conferred by mutant BTG1 
might be especially deleterious in cells with 
constitutive Toll-like receptor and B cell recep- 
tor signaling due to MYD88 and CD79B mu- 
tations. Alternatively, the aberrant memory and 
clonal precursor cells induced by MYD88 and 
TBL1IXRI1 mutations (12, 13, 64) could gain po- 
tential fitness advantage for reentry into subse- 
quent GC reactions. Finally, the supercompetitor 
phenotype conferred by BTGI mutations seems 
to manifest clinically, because BTG] mutation 
independently associated with inferior outcome 
among ABC-DLBCLs. It is thus warranted for 
future studies to explore potential therapeutic 
vulnerabilities that could be exploited for treat- 
ment of these lethal tumors. 


Potential limitations 

The lymphomagenic potential of Btel@°*" or 
Btg1"”" was evaluated in separate experiments 
without affecting our conclusions, because 
they were evaluated in comparison to their 
respective positive and negative controls in 
each experiment (Fig. 7B and fig. SI5B) in com- 
parable experimental setups (fig. S15C). In 
some mouse experiments, we assessed mutant 
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Btg12*" effects using animals that did not 
carry the Cdi9™ allele. This is not expected to 
be an issue for the following reasons: (i) We 
observed a gain-of-function phenotype (.e., 
gain of competitiveness) rather than a loss-of- 
function effect, and Cre-associated toxicity 
would, on the contrary, impair GC B cell com- 
petitiveness through cell-killing effects (65). 
Gi) Cre knock-in at the Cd79 locus results in 
only one functional Cd/9 allele (32). Given its 
function as a BCR co-receptor, Cdi9“° knock- 
in would be expected to reduce B cell activa- 
tion and competitiveness. (iii) Throughout the 
manuscript, and using multiple orthogonal 
approaches, Q36H was shown to provide a 
similar competitive advantage to GC B cells 
against any tested control, including Cre- 
positive and Cre-negative cells (e.g., Fig. 2, A 
to C, and figs. S6, A and C, S7, A and B, and 
S13E). 


Materials and methods 
Mouse models 


Animal care was in strict compliance with in- 
stitutional guidelines established by Weill 
Cornell Medicine, Guide for the Care and Use 
of Laboratory Animals (66), and the Associa- 
tion for Assessment and Accreditation of Lab- 
oratory Animal Care International. All mouse 
procedures were approved by the Research 
Animal Resource Center and Institutional Ani- 
mal Care and Use Committee of Weill Cornell 
Medicine (protocol #2011-0031). All mouse ex- 
periments were conducted using age- and sex- 
matched animals 8 to 14 weeks old, unless 
specified otherwise. All experiments included 
male and female animals in all groups, ex- 
cept for adoptive transfers, with all recipient 
mice being males. No sex-based influence or 
bias was detected in the observations made 
in this work. 

Conditional knock-in Btg mice 
(R265! Bts1Q36H) Were generated by inGenious 
Targeting Laboratory Inc. (Ronkonkoma, NY, 
USA) via insertion of a stop cassette in intron 1 of 
the Rosa26 locus followed immediately by a 
mutated murine Bigl?™ cDNA-T2A-Luciferase 
sequence (fig. S2A). Murine Big? and human 
BTGI cDNA sequences are slightly different, 
but their protein sequence is 100% identical. 
The targeting vector was constructed as fol- 
lows: The murine Big" cDNA-T2A-Luciferase 
sequence was first cloned into the Mlul site of 
a pSa-stop-bGH polyA vector to generate the 
construct pSa-stop-Bigl?™" cDNA-T2A-Luciferase- 
bGH polyA. The stop cassette was LoxP-PGK/ 
gb2 promoter-neo/kana resistance gene-PGK 
polyA-2X SV40 polyA signal-LoxP. The final 
cassette therefore consisted of a splice accep- 
tor, a LoxP-flanked stop cassette, Big/?°°™ 
cDNA-T2A-Luciferase, and a bGH polyA sig- 
nal. Then, the entire cassette was inserted into 
the Xbal site (XbaI was disrupted by this in- 
sertion) of the pROSA26-1 vector using the 
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Recombineering technology. The resulting 
targeting vector contained a short homology 
arm with a 1.08-kb Rosa26 genomic sequence 
upstream of the cassette and a 4.34-kb-long 
homology arm downstream of the cassette. 
The targeting vector was confirmed by restric- 
tion analysis and sequencing after each mod- 
ification. iTL IC1 (C57BL/6) embryonic stem 
cells were electroporated with this targeting 
vector, stable G418-resistant clones were de- 
rived and microinjected into BALB/c blastocysts. 
Resulting chimeras with a high-percentage 
black coat color were mated to wild-type 
C57BL/6N mice to generate F1 heterozygous 
offspring. Tail DNA was analyzed from pups 
with black coat color. Short homology arm 
integration, long arm junction, and targeted 
allele sequence were confirmed by PCR fol- 
lowed by gel electrophoresis and Sanger se- 
quencing (Genewiz). 

Conditional Btg1”™ mice (R26"'B2") were 
generated at the MSKCC Mouse Genetics Core 
Facility by introducing a single-nucleotide sub- 
stitution to revert the Q36H mutation of the 
cDNA sequence in R26"! ®8!9368 mice to a 
wild-type allele, by CRISPR-Cas9-mediated 
genome targeting (fig. S2, D and E). Oocytes 
from 3- to 5-week-old R26)Bts1Q36H/1s).Btg1Q36H 
or R265! Bts1Q36H/+ supnerovulated females and 
sperm from >2-month-old R26" BE! oH sBiElQ3eH 
males were used for in vitro fertilization. Re- 
sulting zygotes were injected with a mix of 
Cas9 protein (Integrated DNA Technologies), 
CRISPR RNA (crRNA, 5’-GGCTCTGGCTGAA- 
AGTCTGC-3’), trans-activating CRISPR RNA 
(tracrRNA, 5’-AAACAGCAUAGCAAGUUA- 
AAAUAAGGCUAGUCCGUUAUCAACUUGAA- 
AAAGUGGCACCGAGUCGGUGCU-3’), and a 
105-base pair (bp) donor oligo for homologous 
recombination (HR oligo, 5’-ATCTCCAAGTTC- 
CTCCGCACCAAGGGGCTCACGAGCGAGCGA- 
CAGCTGCAGACTTTCAGCCAGAGCCTGCAG- 
GAGCTGCTGGCAGAACATTACAAACATCAC- 
TGG-3'), which contained a C>G substitution 
(in bold in the above sequence) to revert H36 
(CAC) to Q36 (CAG). The mutated C in Bigl?™ 
was also the third base of the protospacer ad- 
jacent motif (PAM) sequence on the reverse 
strand and therefore did not exist in the endo- 
genous Bigi sequence or in the Rosa26 wild- 
type reverted R26'S!-3'st allele, preventing 
Cas9 recutting of the reverted allele. The C>G 
substitution also created a new PvulI site, 
which we used to genotype the pups. After 
injection, two-cell embryos were implanted 
into day 0.5 pseudo-pregnant females. Pups 
were then checked by PCR on genomic DNA 
and Pvull digestion (fig. $2, E and F). Po- 
tential founders were further checked by Sanger 
sequencing (Genewiz). 

The following strains were obtained from 
The Jackson Laboratory (Ben Harbor): C57BL/ 
6J (CD45.2, stock 000664), B6.SJL-Ptprc*Pepc?/ 
BoyJ (CD45.1, SJL*”*, stock 002014), Cd19“° 
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(32) (stock 006785), CyI“° (38) (stock 010611), 
Myc“ (37) (stock 021935), B1-8" (33) (stock 
007594), DEC2057'~ (67) (stock 005528), PA-GEP 
(44) (stock 022486), and mRFPI (68) (stock 
005884). The VavP-Bcl2 (48) model was devel- 
oped by J. M. Adams (Walter and Eliza Hall 
Institute of Medical Research, Australia). 


Genotyping 


PCR-based genotyping of the R26!s!Bs1Q36H 
and R26!5|-Btsit alleles was performed on 
mouse tail lysates. Tail tips were lysed in 
DirectPCR lysis reagent (Viagen 102T) plus 
proteinase K during overnight incubation 
at 55°C with shaking, followed by 45 min 
incubation at 85°C (no shaking) to inactivate 
proteinase K. After a short spin, supernatant 
was used for PCR using Taq polymerase 
(Roche). The forward primer annealed to the 
Rosa26 short homology arm upstream of the 
cassette and was combined with two reverse 
primers. The first one annealed to the 5'LoxP 
site of the targeted allele, and the second one 
annealed to the Rosa26 long homology arm 
downstream of the cassette and amplified 
only the nontargeted allele (because of a >5 kb 
distance with the forward primer in the tar- 
geted allele). Details about primers are in- 
cluded in table S8. The PCR program was 94°C 
for 2 min, [94°C for 30 s, 65°C for 30 s, and 
72°C for 1 min] for 40 cycles, and 72°C for 
7 min. Genotyping for R26!S!-3'81936H and 
R265! bts" alleles, as well as all other mouse 
alleles used in this study, was also done by 
Transnetyx using reverse transcription quan- 
titative polymerase chain reaction (RT-qPCR). 


RNA extraction 


Cells were collected from culture and resus- 
pended in TRIzol (Invitrogen 15596018), or 
cells were sorted directly into TRIzol LS 
(one third more concentrated TRIzol reagent, 
Invitrogen 10296028), and total RNA was 
extracted. Purified RNA was resuspended in 
molecular-grade RNase-free water and ana- 
lyzed on Nanodrop and Qubit Fluorometric 
Quantification for quality check and concen- 
tration measurement, respectively. 


qPCR 


cDNA synthesis was performed with equal 
amounts of total RNA for all samples in each 
experiment, using the Verso cDNA Synthesis 
kit (Thermo Fisher AB1453B). RT-qPCR was 
performed using same volume of cDNA for 
all samples, on a QuantStudio6 Flex Real-Time 
PCR System (Thermo Fisher) with Fast SYBR 
Green Master Mix (Thermo Fisher 4385614). 
Each biological replicate (either independent- 
ly generated DLBCL lines or individual mouse 
samples) was further analyzed in triplicate 
wells to assess technical variability, of which 
only the mean was used to calculate the mean 
+SD between biological replicates. mRNA ex- 
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pression levels of genes of interests were 
normalized to that of TBP and/or GAPDH, 
as specified, using the delta-delta Ct method. 
Details about primers used are included in 
table S8. 


Presence of Cre-recombined alleles 


Cre-mediated recombination of R26! 581036 
and R26" alleles was validated on gDNA 
extracted from sorted naive B cells and ger- 
minal center B cells or from B220*-enriched 
B cells as indicated, using Puregene Gentra 
cell kit (QLAGEN 158388). B220* B cells were 
enriched by positive selection with anti-B220 
magnetic beads (Miltenyi Biotec 130-049-501). 
We performed PCR on gDNA using primers 
that specifically amplified the recombined 
targeted allele and Phusion HF polymerase 
(NEB M0530). The PCR program was 98°C 
for 30 s, [98°C for 15 s, 70°C for 30 s, and 72°C 
for 30 s] for 30 cycles, and 72°C for 10 min. 
PCR products were resolved by agarose gel 
electrophoresis and visualized on a ChemiDoc 
Touch imaging system (BioRad) using SYBR 
Safe DNA stain (Thermo Fisher S33102). The 
same primers were used to confirm expression 
of Cre-recombined alleles on total RNA ex- 
tracted from B220*-enriched B cells, reverse 
transcribed, and analyzed by qRT-PCR. For- 
ward primer annealed to the Rosa26 short 
homology arm upstream of the cassette and 
reverse primer to the kozak sequence and 
BtgI coding sequence from the R26"! B81 
or R26"! et at the Rosa26 locus (fig. $2, A 
and D), to ensure only recombined targeted 
alleles and transcripts were amplified from 
gDNA and total RNA, respectively. Primer de- 
tails are included in table S8. 


Efficiency of Cre-recombination 


The proportion of recombined R26"! 22!236H 
allele was measured in the competitive setup 
using adoptive transfer of B cells with an en- 
dogenous BCR repertoire into WT recipient 
mice. We sorted Q36H naive B cells (NB) as 
live B220*IgD*CD45.2 and germinal center 
B cells (GC B) as live B220*FAS*CD38°CD45.2 
from the spleens of CD45.1 recipient mice at 
day 21 after immunization, when Q36H GC 
B cells are outcompeting WT recipient cells. 
gDNA was extracted using Puregene Gentra 
cell kit (QIAGEN 158388) with volumes 
adapted for low material. For each sample, a 
minimum of 240 pg gDNA per well, as mea- 
sured by Qubit dsDNA HS (Invitrogen Q32854), 
was used for qPCR amplification using two 
sets of primers, the first one specifically am- 
plifying the recombined alleles (same as above 
for PCR on gDNA or for qPCR on cDNA) and 
the second one specifically amplifying the non- 
recombined alleles, sharing the forward primer 
with the first set, but with the reverse primer 
annealing to the STOP cassette (fig. S7D). 
Primer details are included in table S8. A stan- 
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dard curve for each of these primer sets was 
generated, using their corresponding double- 
stranded DNA (dsDNA) amplicons to the 
amount of 10? to 10° absolute copy number 
per well. The efficiency of Cre-mediated re- 
combination was calculated as the absolute 
copy number of recombined alleles over the 
absolute copy number of recombined plus non- 
recombined alleles. 


IgH rearrangement 


For clonal variance analysis in the lymphoma- 
genesis study cohort, GC B cells from Bcl2-only 
and Bcl2+Q36H mice were sorted as live 
B220*FAS*GL7* cells from PNA* enriched 
splenocytes on BD Influx or BD FACSAria II, 
8 months after transplantation. We extracted 
RNA and synthesized cDNA using equal 
amounts of RNA for all samples, as described 
above. In parallel, we processed centroblasts 
sorted from a wild-type mouse that had been 
immunized 9 days before with 2% SRBC, to 
use as anontumor control. We then performed 
PCR on equal volume of cDNA for all samples 
with GoTaq Flexi DNA Polymerase (Promega 
M8295) to amplify the VHQ52-JH4 and 
VH7183-JH4 regions of the /gH locus, using a 
set of forward primers that annealed to the 
framework region of one of the most abun- 
dantly used IgVH gene families and a reverse 
primer annealing to the JH1-4 gene segments, 
as described before (69). Each sample was am- 
plified in duplicates, both of which were 
resolved by agarose gel electrophoresis on 
adjacent lanes and visualized on a ChemiDoc 
Touch imaging system using SYBR Safe DNA 
stain. The PCR program was 95°C for 2 min, 
[95°C for 45 s, 60°C for 45 s, and 72°C for 
2 min] for 40 cycles, and 72°C for 10 min. 
Primer details are included in table S8. 


Immunizations 


To induce GC formation in draining lymph 
nodes, mice were immunized by subcutaneous 
injection of the highly substituted hapten NP 
conjugated to the carrier protein ovalbumin 
(OVA), NP(16-19)-OVA (Biosearch Technolo- 
gies N-5051), either into the hind footpad for 
popliteal lymph nodes collection, or into the 
base of tail for inguinal lymph nodes collection. 
We did not observe differences in GC response 
after NP-OVA immunization in popliteal ver- 
sus inguinal lymph nodes. In noncompetitive 
setup, non-B1-8" mice were injected with 50 ug 
of NP(16-19)-OVA. In competitive setup, recip- 
ients of B1-8™ adoptively transferred B cells 
were injected 24 hours after transfer with 20 ug 
of NP(I6-19)-OVA subcutaneously for T-dependent 
stimulation, or with 50 ug of NP(44)-Ficoll 
(Biosearch Technologies F-1420) intraperitoneal- 
ly for T cell-independent stimulation. Con- 
jugated NP-OVA or NP-Ficoll was absorbed 
in alum adjuvant (Thermo Fisher 77161) at a 
1:3 ratio of alum:immunogen prior to injection. 
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To induce germinal center (GC) formation in 
spleens, mice were immunized by intraperitoneal 
injection of 500 ml of 2% SRBCs resuspended 
in sterile 1X Dulbecco’s phosphate-buffered 
saline (DPBS) from a solution of sheep blood 
in Alsever’s (Cocalico Biologicals 20-1334A), or 
by intraperitoneal injection of 80 ug NP(30-32) 
conjugated to the carrier protein keyhole limpet 
hemocyanin (KLH) (Biosearch Technologies 
N-5060) absorbed in alum adjuvant (Thermo 
Fisher 77161) at a 1:1 ratio of alum:immunogen 
prior to injection. 

For the experiments with in vivo targeted 
delivery of T cell help, recipients of a mixture 
of B1-8"; DEC205*/* and B1-8"; DEC205-/~ 
adoptively transferred B cells were primed by 
intraperitoneal injection of 50 ug OVA (Bio- 
search Technologies O-1000) absorbed in alum 
at a 1:2 ratio of alum:immunogen 2 to 4 weeks 
prior to the transfer and received a boost im- 
munization by subcutaneous injection of 50 ug 
NP(19)-OVA without alum into the hind foot- 
pad 24 hours after transfer. At the indicated 
times at days 6 and 7 after boost, mice received 
5 ug of anti-DEC-OVA chimeric antibody di- 
luted in phosphate-buffered saline (PBS) by 
subcutaneous injection into the hind footpad, 
and popliteal lymph nodes were analyzed at 
day 9 after boost. Anti-DEC-OVA was pro- 
duced by transient transfection of HEK293T 
cells as previously described (70). 


Competitive adoptive cell transfers 


Spleens from mice of the indicated genotypes 
were mashed and filtered through a 40 um 
cell strainer. After treatment with red blood 
cell lysis solution (QIAGEN 158904), spleno- 
cytes were filtered into PBS supplemented 
with 0.5% bovine serum albumin (BSA) and 
2mM EDTA (PBE). Resting B cells were iso- 
lated by negative selection with magnetic cell 
separation using anti-CD43 beads (Miltenyi 
Biotec 130-049-801). To determine the propor- 
tion of NP-binding B1-8" cells, we stained a 
fraction of cells with 5 ug/ml NP(16)-PE (Bio- 
search Technologies N-5070) and analyzed 
them by flow cytometry. We then adoptively 
transferred a total number of isolated B cells 
corresponding to 0.3 x 10° to 0.5 x 10° NP- 
binding B cells per recipient, by intravenous 
injection through the retro-orbital venous sinus, 
at the proportions indicated in the experimen- 
tal schematic for each relevant figure. Alter- 
natively, for the transfer of non-B1-8"™ mature 
B cells, we directly adoptively transferred a 
total number of 20 million isolated B cells per 
recipient by intravenous injection through the 
retro-orbital venous sinus. 

To measure the proliferation rate of adop- 
tively transferred B cells responding to NP- 
Ficoll immunization, isolated resting B1-8" 
B cells from both genotypes were mixed at 
equal proportions of NP-binding cells, stained 
with CellTrace CFSE (Invitrogen C34554) at a 
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final concentration of 5 uM in PBS for 10 min 
at 37°C, and washed with PBE prior to injec- 
tion into recipients. 


Bone marrow and fetal liver cell transplantation 


For lymphomagenesis studies, we transplanted 
bone marrow cells (Bel2*/ ~Q36H cohort) or 
fetal liver cells (Bel2*/ “WTKI cohort) from do- 
nors of the indicated genotypes into lethally 
irradiated C57BL/6J females (two doses of 
450 rad, on a RS 2000 Biological Research 
X-ray Irradiator, Rad Source Technologies). 
Bone marrow cells were harvested from the 
tibias and femurs of donor mice, filtered 
through a 40 um mesh, treated with red blood 
cell lysis solution (QIAGEN 158904), counted, 
and frozen down as viable cells at a density of 
<15 million cells/ml. Genotype of donor mice 
was double checked by PCR and gel electro- 
phoresis before thawing and transplantation. 
Fetal livers were isolated from E14.5 embryos 
and dissociated between the frosted, sand- 
blasted ends of two sterile glass slides (VWR 
48312-024). Fetal liver cells were filtered through 
a 35 um mesh, counted, and frozen down as 
viable cells. Genotyping was performed on em- 
bryo head lysates prepared the same way as tail 
tip lysates, by PCR and gel electrophoresis. 
Bone marrow or fetal liver cells were then 
thawed and cultured at 5 million/ml for 2 hours 
at 37°C in RPMI with 10% fetal bovine serum 
(FBS), 50 U/ml penicillin, 50 ug/ml strepto- 
mycin, freshly added beta-mercaptoethanol 
at 55 uM, interleukin-3 (IL-3) at 6 ng/ml (R&D 
Systems 403-ML), SCF at 10 ng/ml (R&D Sys- 
tems 455-MC), IL-6 at 10 ng/ml (PeproTech 
216-16) and IL-7 at 5.4 ng/ml (R&D Systems 
407-ML). Cells were then collected, washed 
three times with IX DPBS, counted, and resus- 
pended in 1X DPBS for intravenous injection 
of 1 million bone marrow cells or 0.24 million 
fetal liver cells in 100 ul per lethally irradiated 
recipient, through the retro-orbital venous 
sinus. After 2 months, fully engrafted mice were 
immunized monthly by intraperitoneal injec- 
tion of 500 ml of 2% SRBC in sterile IX DPBS 
for 3 or 8 months, as specified, to induce GC 
formation. With the exception of mice euthan- 
ized at 8 months where indicated, all mice 
involved in lymphomagenesis studies were 
monitored until any one of several criteria for 
euthanizing were met, including severe lethargy, 
more than 10% body weight loss, or palpable 
splenomegaly that extended across the midline. 


Live and in situ mouse imaging 


Luciferase reporter imaging in live animals car- 
rying the R26)5-Bts1Q36H or R26! Bigiwt allele 
and in situ imaging was performed on an IVIS 
SpectrumCT In Vivo Imaging System instru- 
ment (PerkinElmer) using the Living Image 
software (PerkinElmer). Mice were injected 
intraperitoneally with 2.5 mg of luciferin sub- 
strate (200 ul at 12.5 mg/ml, Promega P1043) 
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and anesthetized with 2% isoflurane. Biolumi- 
nescence signal (radiance, in photons per sec- 
ond per square centimeter per steradian) was 
acquired 15 min after injection of luciferin. For 
each experiment, images were set to the same 
scale and total luminescence signal (flux, in 
photons per second) was quantified using 
one region of interest per mouse in the Living 
Image software. For in situ imaging, mice were 
euthanized, immediately injected intraperi- 
toneally with 2.5 mg of luciferin substrate, 
opened in the intraperitoneal area to expose 
the spleens and surrounding lymph nodes, and 
imaged within 2 min of euthanasia. 


H&E staining and immunohistochemistry 


Mouse organs were fixed in 10% neutral buf- 
fered formalin (Sigma) for 24 to 48 hours and 
transferred to 70% ethanol. Tissues were then 
embedded in paraffin, processed, and stained 
at the Laboratory of Comparative Pathology 
[Weill Cornell Medicine (WCM)/Memorial Sloan 
Kettering Cancer Center (MSKCC)]. Briefly, 5 um 
sections were deparaffinized and stained with 
hematoxylin and eosin (H&E) or processed 
for immunohistochemistry by heat antigen- 
retrieval in citrate buffer at pH 6.4, followed 
by treatment with 3% hydrogen peroxide in 
methanol to block endogenous peroxidase 
(HRP) activity. Indirect immunohistochem- 
istry was then performed using biotinylated 
peanut agglutinin (PNA, Vector Laboratories 
B1075) or anti-species-specific biotinylated 
secondary antibodies, followed by avidin- 
horseradish peroxidase and development with 
DAB substrate (Vector Laboratories). Sections 
were counterstained with hematoxylin. The 
following primary antibodies were used: anti- 
B220 (BD 550286) and anti-Ki67 (Cell Signal- 
ing Technology 12202). Slides were scanned 
using a Zeiss Mirax Slide Scanner, and photo- 
micrographs were examined using Aperio eSlide 
Manager (Leica Biosystems). QuPath software 
(71) was used to quantify GC areas. 


Flow cytometry analysis and cell sorting 


Spleens were mashed and bone marrow cells 
were flushed from tibias and femurs. Spleno- 
cytes and bone marrow cells were filtered 
through a 40 pm cell strainer and treated with 
red blood cell lysis solution (QIAGEN 158904). 
Resulting single-cell suspensions were resus- 
pended in PBE with 0.5 ug/ml rat anti-mouse 
CD16/CD32 (clone 2.4G2, BD) and incubated 
for =>5 min on ice to block Fc receptors. Lymph 
node cells were directly mashed into the Fc 
block mix using pellet pestles (Sigma Z359947). 
Peritoneal B cells were collected by injecting 
9 ml of PBS with 0.5% BSA into the peri- 
toneal cavity followed by a subcostal incision 
and aspiration of the peritoneal cell suspen- 
sion. After centrifugation, cells were stained 
for 20 min on ice, in a mix of fluorochrome- 
conjugated anti-mouse antibodies diluted either 
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in PBE, or in a 50:50 mixture of Brilliant Stain 
Buffer (BD 566349) and PBS with 0.5% BSA 
but without EDTA when the mix contained 
>2 BV- or BUV-conjugated antibodies, to avoid 
staining artifacts resulting from interaction 
between these fluorescent dyes. For phosphoryl- 
ated S6 (pS6) staining, incubation was pre- 
cisely timed to prevent time-related variations 
in dephosphorylation by endogenous phos- 
phatases, as previously described (5). Cells 
were first stained for surface markers and 
then fixed and permeabilized. Fixation and 
permeabilization using Cytofix/Cytoperm (BD 
554722) was timed to occur exactly 30 min 
after euthanasia for each mouse. We then 
stained with anti-pS6 diluted in Perm/Wash 
(BD 554723) for 2 hours. All cells were then 
washed twice before analysis. DAPI (4’,6- 
diamidino-2-phenylindole; Thermo Fisher 
D1306) was added just before acquisition at 
final 0.5 ug/ml for the exclusion of dead cells. 
Data were acquired on BD FACS Canto II, BD 
Fortessa, BD Symphony A5 or BD Symphony 
A3 and analyzed using FlowJo software pack- 
age (TreeStar). 

For MYC flow cytometry analyses in cell 
lines, two or three independently generated 
and validated SU-DHLI4 stable lines for each 
indicated genotype were set at 1 million cells 
per ml in fresh complete RPMI and incubated 
at 37°C for 46 hours, for cells to have just 
doubled. For MYC flow cytometry analyses in 
human primary GC B cells freshly immortal- 
ized, cells were plated at 1 million cells per ml 
onto fresh feeders and incubated at 37°C for 
3 or 4 days. MG132 (Selleck S2619) was then 
added to cells at 5 uM final MG132. Samples 
were collected at the indicated times after 
treatment and immediately fixed for 1 hour 
at room temperature or overnight at 4°C for 
all samples, depending on experiments (this 
did not affect results) using the True-Nuclear 
Transcription Factor Buffer Set (Biolegend 
424401). Cells were then washed twice in 1X 
perm buffer and kept at 4°C. Fixed and per- 
meabilized cells were blocked with 2% normal 
goat serum (Cell Signaling Technology 5425) 
diluted in perm buffer, for 15 min at room tem- 
perature. We then stained cells with AF647 
anti-MYC (rabbit, clone Y69, Abcam ab190560, 
diluted to 0.71 ug/ml) for 30 min at room 
temperature in the dark. Alternatively, we 
stained cells with unconjugated anti-MYC 
(rabbit, clone Y69, Abcam ab32072, diluted 
to 0.45 ug/ml), washed twice with perm buffer, 
and incubated with AF647 anti-rabbit second- 
ary antibody (Life Technologies A21246, diluted 
to 2 ug/ml) for another 30 min at room tem- 
perature in the dark. We used the following 
isotype control IgGs at the same concentration 
as their corresponding anti-MYC antibodies: 
Abcam ab199093, clone EPR25A, for the AF647 
anti-MYC antibody; and Abcam ab172730, 
clone EPR25A, for the unconjugated anti- 
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MYC antibody. After two washes in perm buf- 
fer, we resuspended cells in perm buffer and 
analyzed them. 

For cell sorting from spleen and lymph nodes, 
cell suspensions were prepared and stained as 
for flow cytometry analysis, except that spleno- 
cytes were pre-enriched either for B220* B cells 
by positive selection with anti-B220 magnetic 
beads (Miltenyi Biotec 130-049-501) or for GC 
B cells by positive selection with PNA mag- 
netic beads (Miltenyi Biotec 130-110-479) where 
specified. The indicated populations were then 
isolated on a BD FACSAria II or a BD Influx 
sorter at the Flow Cytometry core (WCM) or in 
the laboratory. 

The following antibodies were used: APC- 
Cy7 anti-B220 (clone RA3-6B2, BioLegend 
103224, dilution 1:500), BV786 anti-B220 (clone 
RA3-6B2, BD 563894, dilution 1:300), FITC 
anti-human CD2 (clone RPA-2.10, BD 556608, 
dilution 1:100), BV605 anti-CD5 (clone 53-7.3, 
BD 563194, dilution 1:500), APC-Cy7 anti-CD19 
(clone 6D5, BioLegend 115530, dilution 1:200), 
FITC anti-CD19 (clone 1D3, BD 553785, dilu- 
tion 1:500), PE-Cy7 anti-CD21 (clone 7E9, Bio- 
Legend 123420, dilution 1:500), FITC anti-CD23 
(clone B3B4, BD 553138, dilution 1:500), PE-Cy7 
anti-CD25 (clone PC61, BD 561780, dilution 
1:500), APC anti-CD38 (clone 90, eBioscience 
17-0381-81, dilution 1:500), BUV395 anti-CD38 
(clone 90, BD 740245, dilution 1:500), BUV395 
anti-CD45.1 (clone A20, BD 565212, dilution 
1:200), PerCP-Cy5.5 anti-CD45.1 (clone A20, 
eBioscience 45-0453-82, dilution 1:200), AF647 
anti-CD45.2 (clone 104, BioLegend 109818, 
dilution 1:500), APC anti-CD45.2 (clone 104, 
BioLegend 109814, dilution 1:400), PE-Cy7 
anti-CD45.2 (clone 104, eBioscience 25-0454- 
82, dilution 1:300), biotin anti-CD83 (clone 
Michel-19, BioLegend 121504, dilution 1:200), 
BV605 anti-CD86 (clone GL-1, BioLegend 
105037, dilution 1:200), PE-Cy7 anti-CD86 
(clone GL-1, BioLegend 105014 or BD 560582, 
dilution 1:200), BUV737 anti-CD138 (clone 
281-2, BD 564430, dilution 1:500), APC anti-c- 
Kit/CD117 (clone 2B8, BD 561074, dilution 
1:500), biotin anti-CXCR4 (clone 2B11, BD 
551968, dilution 1:200), PE anti-CXCR4 (clone 
2B11, BD 561734, dilution 1:100), BV421 anti- 
FAS (clone Jo2, BD 562633, dilution 1:300), 
PE-Cy7 anti-FAS (clone Jo2, BD 557653, di- 
lution 1:500), AF647 anti-GL7 (clone GL7, BD 
561529, dilution 1:500), AF647 anti-MYC (rab- 
bit, clone Y69, Abcam ab190560, diluted to 
0.71 ug/ml), unconjugated anti-MYC (rabbit, 
clone Y69, Abcam ab32072, diluted to 0.45 ug/ 
ml) and AF647 anti-rabbit secondary antibody 
(Life Technologies A21246, diluted to 2 ug/ml), 
BV510 anti-IgD (clone 11-26c.2a, BD 563110, 
dilution 1:500), PE anti-IgD (clone 11-26c.2a, 
BD 558597, dilution 1:500), PerCP-Cy5.5 anti- 
IgD (clone 11-26c.2a, BioLegend 405710, dilu- 
tion 1:500), APC anti-IgG1 (clone A85-1, BD 
560089, dilution 1:500), BV421 anti-IgG1 (clone 
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A85-1, BD 562580, dilution 1:500), BV650 anti- 
IgG1 (clone RMG1-1, BioLegend 406629, dilution 
1:500), APC anti-IgM (clone II/41, eBioscience 
17-5790-82, dilution 1:500), BV711 anti-IgM (clone 
II/41, BD 743327, dilution 1:500), BV421 anti- 
Ig lambda light chain (A1, 12, 13, clone R26-46, 
BD 744523, dilution 1:400), AF647 anti-pS6 
(S6 phosphorylated at S235/236, clone D57.2.2E, 
Cell Signaling Technology 4851, dilution 1:200), 
APC streptavidin (BioLegend 405207, dilu- 
tion 1:300) and PE-Cy7 streptavidin (eBio- 
science 25-4317-82, dilution 1:400). 

To assess progression through S phase of the 
cell cycle, we performed dual nucleotide pulse 
and staining as previously described (46). 
Briefly, mice were injected intravenously with 
1 mg of EdU (Thermo Fisher E10187) and 
1 hour later with 2 mg BrdU (Sigma B5002). 
Thirty minutes after the second injection, 
lymph nodes were harvested, and single cell 
suspensions were prepared. After cell surface 
staining as described above, cells were fixed 
and permeabilized using BD Cytofix/Cytoperm 
and BD Permeabilization Buffer Plus. EdU and 
BrdU incorporation into DNA was assayed 
using the Click-iT EdU Pacblue Flow Cytom- 
etry Assay Kit (Thermo Fisher C10418) and 
AF647 mouse anti-BrdU (clone 3D4, BD 560209), 
respectively. 


EdU staining on a slide 


Mice were injected intravenously through the 
retro-orbital venous sinus with 1 mg EdU 
(200 pl at 5 mg/ml) 1 hour before euthanasia. 
Spleens were processed and splenocytes stained 
and sorted as described above. Myc“"?* and 
Myc“"?- GC B cells were sorted on a BD 
FACSAria II into PBE and immediately spun 
onto positively charged glass slides (VWR 
48311-703) by cytospin using an Aerospray 
Pro slide stainer/cytocentrifuge (ELITechGroup) 
and left to air dry for 2 hours. Cells were fixed 
with 4% paraformaldehyde for 15 min at room 
temperature, washed three times with PBS, 
and left to dry overnight at room tempera- 
ture. Cells were then washed using PBS plus 
3% BSA and permeabilized using 0.5% triton- 
X for 20 min. EdU was detected using Click-iT 
Plus EdU Cell Proliferation Kit for Imaging, 
AF647 dye (Thermo Fisher C10640). This in- 
cluded a staining step with DAPI at 1 ug/ml 
final. Aqueous mounting media (Fluoromount, 
Sigma F4680) and coverslips (VWR micro 
cover glass) were applied to slides right 
after staining, and imaging was performed 
immediately. 

Images were acquired on a Zeiss LSM 880 
laser scanning confocal microscope with a 10x 
objective lens using 405 nm and 633 nm laser 
excitation for DAPI and AF647, respectively, 
using the Zen software (Zeiss). Alternatively, 
images were acquired on an Evos FL Digital 
inverted fluorescence microscope (Life Techno- 
logies) with a 10x objective lens, a high-sensitivity 
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monochrome camera (Sony ICK285AL CCD) 
and DAPI and Cy5 Evos light cubes to detect 
DAPI and AF647, respectively. Representative 
images shown were acquired on the Evos in- 
strument. All images were viewed and pro- 
cessed using the Fiji software (72). Cyan and 
magenta lookup tables (LUTs) were applied 
to DAPI and AF647 channels, respectively, for 
visualization. 


Generation of stable DLBCL cells 


All constructs (EFla-hBTG1, EFla-hBTG1?*™", 
EFla-EV, EFla-hBTGI-V5, EFla-hBTG1?""-v5, 
and EFla-EGFP-V5) were cloned into the pLEX_ 
307 vector (a gift from D. Root, Addgene 41392), 
which drives constitutive lentiviral expression 
of the open reading frames (ORFs) as fusion 
proteins with a C-terminal V5 epitope tag 
under the control of the EF1-alpha promoter, 
as well as puromycin resistance. ORFs with a 
stop codon were used to express nontagged 
proteins, and ORFs lacking a stop codon were 
used to express V5-tagged proteins. Human 
BTGI1 ORF with stop codon was obtained in 
pENTR221 vector (Transomic) and used to 
PCR amplify BTG1 ORF without stop codon. 
Q36H mutation was introduced into BTG1 
with or without stop codon by site-directed 
mutagenesis using Phusion HF polymerase 
(NEB) followed by DpnI digestion. An empty 
control vector (EV) was generated by NotI/ 
AsclI double digestion of an ORF-containing 
pENTR/D-TOPO vector, followed by blunt- 
ing with T4 DNA polymerase and ligation. 
EGFP ORF without stop codon was obtained 
in pENTR221 [a gift from D. Root, Addgene 
25899 (73)]. All of the above entry vectors 
were recombined into the pLEX_307 desti- 
nation vector using the Gateway LR Clonase 
II enzyme mix (Invitrogen 11791-020). T39I, 
S43N, and A49T mutations were introduced 
into BTGI1-V5 by site-directed mutagenesis 
using Phusion HF polymerase (NEB) followed 
by DpnI digestion. Identity of all vectors was 
verified by Sanger sequencing (Genewiz). 

For lentivirus production, 2.5 million 
HEK293T cells cultured in complete Dulbecco’s 
modified Eagle medium [DMEM (Thermo 
Fisher 11965118); 10% FBS, 2 mM glutamine, 
50 U/ml penicillin, and 50 pg/ml streptomycin] 
were plated in a 10-cm-diameter plate for 
24 hours before cotransfection with the ex- 
pression construct, pMD2.G, and psPAX2 vec- 
tors at a 4:1:3 ratio using Fugene HD (Promega 
E2311) for a total of 8 ug DNA per plate. After 
12 to 16 hours, medium was replaced. Virus- 
containing medium was collected at 48 and 
72 hours after transfection, pooled, filtered, 
concentrated using PEG-it (System Biosciences 
LV825A-1), and resuspended at 20X in IX DPBS. 
Virus was used right away or stored aliquoted 
at -80°C. 

SU-DH14 cells cultured in complete RPMI 
(10% FBS, 2 mM glutamine, 50 U/ml penicillin, 
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50 ug/ml streptomycin, and 10 mM HEPES) 
were transduced at a multiplicity of infection 
of 1 by adding concentrated virus to freshly 
plated cells. After 72 hours incubation, medium 
was replaced with fresh medium containing 
1 ug/ml puromycin, and cells were selected 
for 7 days, with fresh addition of puromycin- 
containing medium every other day. Expres- 
sion of the constructs was validated by qRT-PCR 
and Western blot (WB) from an equal number 
of cells from cell suspension cultures. 


Generation of stable human primary GC B cells 


Fresh human tonsils were collected from the 
Department of Pathology at Weill Cornell/New 
York Presbyterian (IRB protocol 0804009763). 
To generate human primary GC B cells stably 
expressing BTG1™” versus BTG12*", we adapted 
a procedure previously described in detail (43). 
This involved isolating GC B cells from human 
tonsils, immortalizing them through BCL2/ 
BCL6 retroviral transduction (the CD2* marker 
was used to identify immortalized cells), and 
stably expressing V5-tagged BTGI™’, BTGI@*", 
or EGFP through lentiviral transduction and 
puromycin selection (fig. SIID). 

Tonsillar GC B cells were purified using 
human B cell negative selection isolation Kit IT 
(Miltenyi Biotec 130-051-151) with addition of 
IgD-BIOT (SouthernBiotech 9030-08, dilution 
1:100) and CD44-BIOT (SouthernBiotech 9400- 
08, dilution 1:100) antibodies to remove naive 
and memory B cells. Purified GC B cells were 
cultured in RPMI (20%, 2 mM glutamine, 
50 U/ml penicillin, 50 pg/ml streptomycin) 
at a concentration of 1 million cells per ml, on 
irradiated (30 Gy) YK6-CD40Lg-IL21 feeder 
cells that carry puromycin resistance (0.2 mil- 
lion cells per ml). GC B cells were replated 
every 3 to 4 days on freshly irradiated YK6 
feeder cells. 

For virus production, Lenti-X 293T cells 
were plated in a 10-cm? dish 16 hours before 
transfection at 0.2 million cells per ml in DMEM 
(10%, 2 mM glutamine, 50 U/ml penicillin, 
50 ug/ml streptomycin). For retrovirus pro- 
duction (BCL2-BCL6), vectors were mixed as 
followed: 1 ug of pHIT6O (gag-pol), 1 ug of 
phCMV-GaLV MTR [a gift from D. Hodson, 
Addgene 163612 (74)] and 4 ug of MSCV-BCL6- 
t2A-Bcl2 [a gift from D. Hodson, Addgene 
135305 (74)] and incubated for 45 min in 1 ml 
Opti-MEM with 18 ul of Trans-IT (Mirus Bio 
MIR 2300) before addition to the Lenti-X 293T 
dish. For lentivirus production (V5-tagged 
BIG1"", BTG1°%**", EGFP), 8.3 ug of gag-pol 
vector pCMVDeltaR8.91, 2.8 ug of envelop vec- 
tor phCMV-GaLV MTR [a gift from D. Hodson, 
Addgene 163612 (74)], and 11 ug of BTGI1 con- 
structs in pLEX_307 vector (described above 
for cell lines) were incubated for 45 min in 1 ml 
Opti-MEM with 33 ul of Trans-IT (Mirus Bio, 
MIR 2300) and added to the Lenti-X 293T dish. 
Viruses were collected and filtered (0.45 um) 
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72 hours later and frozen or directly used for 
infection of GC primary cells. 

For infection, 2 million GC B target cells 
with feeder cells were directly resuspended 
in virus suspension at a virus:media ratio of 
1:4 for retrovirus or 3:4 for lentivirus, with 
addition of 8 ng/ml of polybrene (Santa Cruz 
sc-134220) and 25 uM HEPES. Cells and virus 
mixtures were plated in 12-well plates and 
centrifuged at 1500g for 90 min at 32°C. Virus- 
containing medium was replaced with fresh 
medium (RPMI 20%, 2 mM glutamine, 50 U/ml 
penicillin, 50 ug/ml streptomycin) immedi- 
ately after centrifugation for retroviral infec- 
tion and at least 4 hours after centrifugation 
for lentiviral infection (43). Seventy-two hours 
after lentiviral infections, cells were treated 
with 1 ug/ml puromycin for 10 days to select 
for transduced cells. BCL2-BCL6 positivity was 
continuously monitored by anti-CD2 flow cyto- 
metry. Experiments were performed when 
>95% puromycin-selected cells were obtained 
within CD2* GC B cells. 


mRNA stability assay 


Three independently generated and validated 
SU-DHL4 stable lines for each indicated geno- 
type were set at 1 million cells per ml in fresh 
complete RPMI and incubated at 37°C for 
24 hours, for cells to be growing exponen- 
tially and at similar medium composition at 
the time of treatment. Cells were then collected 
(0 hours) or actinomycin D (Cell Signaling 
Technology 15021) was added at 10 ug/ml final 
and cells were collected at the indicated time 
points after treatment. Upon collection, cells 
were spun and directly resuspended in TRIzol 
for RNA extraction, followed by cDNA synthe- 
sis and qRT-PCR analysis as described above. 


RNA immunoprecipitation (RIP) 


Two or three independently generated and 
validated SU-DHIA4 stable lines were used for 
each indicated genotype per experiment. Thirty- 
five million cells per sample were set at 1 million 
cells per ml in fresh complete RPMI and 
incubated at 37°C for 24 hours, for cells to be 
growing exponentially and at similar medium 
composition at the time of treatment. Cells 
were collected, washed twice with cold 1X 
DPBS, and lysed in 175 ul Magna RIP lysis 
buffer (from Millipore 17-700) with frequent 
vortexing for 10 to 30 min on ice. Lysates 
were directly processed for RIP using mag- 
netic beads, following the Magna RIP kit 
instructions (Millipore 17-700) or kept at —-80°C 
until processing. Anti-V5 mouse antibody 
(7.5 ug; Abcam 27671) was used to pull down 
BIGI""-V5, BTGI?**"-V5, or EGFP-V5 (nega- 
tive control). Ten percent of lysate was saved 
pre-RIP as input RNA, 175 ul lysate was RIPed, 
and the remaining lysate volume was saved for 
protein quantification by Bradford to analyze 
by Western blot as input protein. Post-RIP, 6% 
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of the beads suspension was saved during the 
last wash to analyze by Western blot as RIP 
protein. RNAs were purified from the beads 
and input RNA samples according to the 
Magna RIP kit instructions and resuspended 
in 25 wl molecular grade RNase-free water. 
Equal RNA amounts were then used for cDNA 
synthesis and qRT-PCR as described above. 
In the first experiment, RIP was done in 
technical duplicates for each independent line 
(35 million cells each, two independent lines 
per genotype). We observed highly similar pull- 
down efficiency between technical duplicates 
(compare a and b replicates, fig. S9A), and thus 
subsequent experiments were done with three 
independent lines and one RIP per genotype. 


Western blot 


Three independently generated and validated 
SU-DHL4 stable lines for each indicated geno- 
type were set at 1 million cells per ml in fresh 
complete RPMI and incubated at 37°C for 
24 hours, for cells to be growing exponen- 
tially and at similar medium composition 
at the time of treatment. For the starvation 
analyses, after the initial 24 hours incubation, 
cells were resuspended in fresh complete 
RPMI without FBS, incubated for 4 hours 
at 37°C, followed by resuspension in fresh 
complete RPMI (with 10% FBS) for 15 min, 
30 min, 1 hour, and 2 hours, at which times 
cells were collected, spun, and directly lysed in 
lysis buffer (50 ul per million cells set up). For 
the newly protein synthesis analyses, after the 
initial 24 hours incubation, DMSO or MG132 
(Selleck S2619) was added to cells for 5 uM 
final MG132 (and 0.01% DMSO final for all 
samples), and samples were collected at the 
indicated times after treatment, spun, and 
directly lysed in lysis buffer (50 ul per million 
cells plated). For the protein stability analyses, 
after the initial 24 hours incubation, CHX 
(stock kept in —80°C at 20 mg/ml in H,O, 
Santa Cruz Biotechnology 66-81-9) was added 
to cells at 40 ug/ml final, and samples were 
collected at the indicated times after treat- 
ment, spun, and directly lysed in lysis buffer 
(50 ul per million cells plated). After develop- 
ment and signal quantification as described 
below, nonlinear regression curves were gen- 
erated using exponential one phase decay equa- 
tion with plateau constraint to zero. 

Lysis buffer composition was 20 mM HEPES 
KOH pH 7.5, 50 mM b-glycerophosphate, 1 mM 
EDTA pH 8.0, 1 mM EGTA pH 8.0, 0.5 mM 
Na3VO4, 100 mM KCl, 10% glycerol, and 1% 
Triton X-100, supplemented with complete 
protease inhibitor cocktail (Roche). Protein con- 
centration was determined by Bradford (Thermo 
Fisher 23236) and equal protein amounts were 
separated by 10 or 12% SDS-polyacrylamide 
gel electrophoresis. After electrophoretic trans- 
fer to BioTrace NT pure nitrocellulose blotting 
membrane (PALL 66485), membranes were 


Mlynarczyk et al., Science 3'79, eabj7412 (2023) 


stained with Ponceau, rinsed and blocked with 
5% nonfat dry milk (Biorad 1706404) unless in- 
dicated otherwise, in PBS containing 0.1% 
Tween-20. Proteins were probed by overnight 
incubation at 4°C with the following anti- 
bodies: anti-V5 for Western blot after RIP 
(rabbit, Sigma V8137, dilution 1:2000), anti-V5 
for all other Western blots (mouse, abcam 
27671, dilution 1:2500), anti-alpha-tubulin 
(mouse, Sigma T6199, dilution 1:5000), anti- 
beta-actin (mouse, Sigma 5441, dilution 1:15,000), 
anti-MYC (rabbit, Cell Signaling Technology 
5605, 1:1000), or anti-RPS3 (rabbit, Cell Signal- 
ing Technology 9538, 1:3000 in 5% BSA). 
Membranes were then incubated with HRP- 
conjugated anti-mouse or anti-rabbit second- 
ary antibody (Cell Signaling Technology 7076 
or 7074) or with IRDye 800CW anti-rabbit 
antibody (LI-COR 926-32211) for RPS3 de- 
tection. Development was done with ECL 
(Thermo Fisher 32106) for HRP-conjugated 
antibodies or directly for IRDye 800CW, 
either on a ChemiDoc Touch imaging system, 
ChemiDoc MP Imaging System (BioRad) or 
using Amersham Hyperfilm (Thermo Fisher 
45001508) and a developer. Protein band in- 
tensities were quantified using the chemi- 
luminescence images acquired on the ChemiDoc 
and the Image Lab software v6.1.0 (Bio-Rad). 


Cell volume measurement on Coulter counter 


Three independently generated and validated 
SU-DHLIA4 stable lines for each indicated geno- 
type were set at 1 million cells per ml (in trip- 
licates or quadruplicates) in fresh complete 
RPMI and incubated at 37°C for 24 hours. 
Cells were then resuspended, and an aliquot 
of cell suspension was used immediately for 
cell volume (mean fL) analysis on a Multisizer 
4e Coulter particle analyzer (Beckman Coulter 
B43905). 


Polysome profiling 


Three independently generated and validated 
SU-DH14 stable lines for each indicated geno- 
type were set at 1 million cells per ml in fresh 
complete RPMI and incubated at 37°C for 
48 hours. Equal numbers of cells for each 
line (30 million or 100 million cells, depending 
on experiment replicate) were treated with 
CHX at 100 pg/ml for 10 min at 37°C. Cells 
were then lysed on ice for 10 min in 400 ul 
lysis buffer containing 20 mM Tris HCl pH 
7.4, 100 mM KCl, 5 mM MgClo, 1% TritonX- 
100, 100 ug/ml CHX, 10 mM DTT, EDTA-free 
protease inhibitors (Roche 11873580001,) and 
500 U/ml RNasin RNase inhibitor (Promega 
N2611). Lysates were cleared by centrifugation 
at 15,000g, 4°C for 15 min. Cleared lysate 
aliquots were saved as inputs, and remaining 
cleared lysates were loaded onto a linear 20 to 
50% sucrose gradient in 40 mM Tris HCl pH 
7.5, 300 mM KCl, 20 mM MgCly, and 100 ug/ml 
CHX. Samples were then centrifuged at 
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42,000 rpm for 3 hours at 4°C using a TH641 
rotor (Sorvall) in a Sorvall wX+ Ultra Series 
Centrifuge (Thermo Scientific). After centrif- 
ugation, fractions were collected using a 
BioComp Gradient Station ip. RNA content 
was measured at A260 concurrent with frac- 
tion collection using Triax Flow Cell v1.50A 
software. AUCs and analyses thereof were 
calculated using Prism 8 software. 

After collection, fractions were prepared for 
RNA precipitation by adding one-tenth of frac- 
tion volume of 3 M NaOAc and 1 volume of 
fraction volume of isopropanol and incubated 
at -80°C overnight. Precipitated RNAs were 
extracted and purified using the Direct-zol 
RNA Miniprep kit and protocol (Zymo Research, 
R2050). After adding ERCC spike-ins (Thermo 
Fisher 4456740) per same volume of RNA for 
all samples (input and fractions) to be able to 
normalize for potential differences in effi- 
ciency of RT or qPCR amplification, we used 
equal volumes of samples for RT and qPCR. 
For MYC, we generated a standard curve using 
MYC dsDNA amplicons at 10° to 10° absolute 
copy number per well. Details about primers 
used are included in table S8. 

For WB analysis, fractions were prepared 
for protein precipitation by adding cold tri- 
chloroacetic acid (TCA) at 15% final and incu- 
bated at —20°C overnight. Samples were thawed 
on ice and centrifuged for 30 min at 15,000g, 
4°C. Pellet was resuspended in 1 ml of cold 
15% TCA to fully remove sucrose and salt 
and centrifuged again for 15 min at 15,000g, 
4°C. The pellet was washed twice with 500 ul 
acetone and centrifuged for 10 min at 15,000g, 
4°C. Samples were then dried using speedvac 
and resuspended in 1X LDS (Thermo Fisher 
B0007) diluted in RIPA buffer [50mM HEPES 
pH 7.4, 150 mM NaCl, 1% sodium deoxycholate, 
0.1% SDS, 10 mM sodium pyrophosphate, 
10 mM b-glycerophosphate, 1% NP-40, supple- 
mented with 1 mM MgCl, phosphatase inhibitor 
cantharidin (Gold Biotechnology C-975-500), 
benzonase (EMD Millipore 71205-3)] and pro- 
tease inhibitor cocktail as follows: 115 mM so- 
dium molybdate dihydrate (Gold Biotechnology 
M-781-100), 100 mM sodium orthovanadate 
(Gold Biotechnology S-930-25), 100 mM sodium 
fluoride (Alfa Aesar AAA1301930), 200 mM 
imidazole (Gold Biotechnology I-902-25), and 
400 mM sodium tartrate dihydrate (Alpha Aesar 
AAA1618730). Samples were then boiled at 75°C 
for 15 min and analyzed by Western blot. 


UV cross-linking and immunoprecipitation (CLIP) 
and RNA radiolabeling 


Immunoprecipitation and radiolabeling of RNAs 
bound to V5-tagged proteins were performed 
as previously described (75) with the following 
modifications: 20 million cells were seeded 
24 hours prior to cross-linking. Cells were 
centrifuged at 300g for 5 min, washed in 10 ml 
of 0.22 um filtered media, then centrifuged 
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again and resuspended at a final concentra- 
tion of 4 million cells per ml in 0.22 um filtered 
media. Cells were then cross-linked with 
150 mJ/em? at 254 nm using a 254 nm UV 
cross-linker instrument (HL-2000 HybriLinker, 
UVP) and centrifuged, and pellets snap frozen 
on dry ice. 

Cross-linked cell pellets were resuspended 
in cold lysis buffer: 50 mM Tris-HCl pH 7.5, 
300 mM NaCl, 0.5% Nonidet-P40 (IGEPAL 
CA-630), supplemented with complete EDTA- 
free protease inhibitor (1 tablet per 25 ml, 
Roche 11873580001) and 1X phosphatase in- 
hibitor (100X Halt cocktail, Thermo 8440) 
prior to use. Cells in lysis buffer were sonicated 
using a Branson Digital Sonifier Model 450 
fitted with 3.125-mm tapered micro tip probe 
on ice at 10% amplitude (2-s ON, 10-s OFF 
cycle, total 30 s). A Western blot was per- 
formed on lysate aliquots to quantify the levels 
of V5-tagged proteins. Western blot lysates 
were first treated with 5 U/ul RNasel for 
10 min at 37°C to completely digest RNA. Cell 
lysates were normalized according to the levels 
of V5-tagged proteins. DNase I (0.002 U/ul; 
Thermo Scientific EN0521) and RNase I 
(0.02 U/ul; Thermo Scientific AM2294) were 
used to digest DNA and RNA in the cell lysate 
for 3 min at 37°C . 

Anti-V5 antibody (2.5 ug; mouse, Abcam 
27671, dilution 1:2500) was used for immuno- 
precipitation (IP) of cross-linked RNA-protein 
complexes per sample. V5 antibodies were 
first bound to Pierce Protein A/G Magnetic 
Beads (Thermo Scientific 88802) in lysis buffer 
for 1 hour at 4°C. IP was performed in lysis 
buffer at 4°C overnight. As previously de- 
scribed (75), bound RNA-protein complexes 
were washed twice with stringent high-salt 
buffer (50 mM Tris-HCl pH 7.4, 1 M NaCl, 
1mM EDTA, 1% NP-40, 0.1% SDS, 0.5% so- 
dium deoxycholate) and twice with low-salt 
buffer (20 mM Tris-HCl pH 7.4, 10 mM MgCl, 
0.2% Tween-20). Half the IP lysate was used 
for Western blot to verify levels of V5-tagged 
proteins (anti-V5, Sigma V8137, dilution 1:2000). 
Western blot lysates were treated with 0.2 U/ul 
RNasel for 10 min at 37°C to completely digest 
RNA. In the remaining half of the IP lysate, 
RNA bound to pulled down proteins was la- 
beled with 0.25 U/ul T4 PNK (New England 
Biolabs M0201) and 0.25 uCi/ul [y-32P]- ATP 
(Perkin Elmer BLU002Z500UC). RNA-protein 
complexes were eluted from the beads in 1X 
NuPage LDS Sample Buffer (Invitrogen NPO0008) 
plus 10 mM DTT and run on an SDS-PAGE 
gel, before transfer onto a nitrocellulose mem- 
brane. The membrane was then exposed using 
autoradiograph film. 


RNA-, RIP-, and BCR-sequencing sample preparation 


RNA was extracted using TRIzol as described 
above, quantified with Qubit, and analyzed for 
integrity using RNA Pico Chips (Agilent 5067- 
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1513) on an Agilent 2100 Bioanalyzer (Agilent 
Technologies). Samples with RNA integrity 
number (RIN) = 8 were further processed. On 
the basis of the available amount, 2 to 100 ng 
RNA was used for cDNA synthesis and qRT- 
PCR to validate samples identity pre-library 
preparation. Equal amounts of RNA were used 
for all samples for each experiment for library 
preparation. Libraries were validated on the 
Agilent 2100 Bioanalyzer before sequencing 
and quantified using Quant-iT dsDNA HS As- 
say (Thermo Fisher Q33120). Each library was 
generated with a unique index sequence, and 
libraries were pooled with the same ratio for 
all samples for sequencing, except for the RIP 
as described below. Library preparation, se- 
quencing and postprocessing of raw data was 
performed at the Epigenomics Core and Ge- 
nomics Resources Core facilities (WCM). 

For SU-DHI4 RNA-seq, three independent 
stable lines were freshly generated as described 
above, selected with 1 ug/ml puromycin for 
4 days, enriched for viable cells by gradient 
separation (Fico/Lite-LymphoH, R&D Systems 
140150), and 200,000 viable cells were used for 
RNA extraction. For library preparation, 200 ng 
RNA per sample was used. 

For GC B cell bulk RNA-seq, live B220*FAS* 
GL7* cells were sorted from B220* enriched 
splenocytes on BD Influx as described above, 
from four CRE and five Q36H mice. Per sam- 
ple, 123,000 to 344,000 GC B cells were sorted, 
of which 120 to 236 ng RNA was extracted. 

For both SU-DHL4 and GC B cell bulk RNA- 
seq, libraries were prepared using Illumina 
TruSeq stranded-mRNA sample kit and sub- 
jected to single read 50 bp sequencing on an 
Illumina HiSeq 2500. 

For Myc@?* and Myc“ GC B cells RNA-seq, 
live B220*FAS*CD38° Myc“?* or Myc" cells 
were sorted from B220* enriched splenocytes 
on BD Influx as described above, from two 
CRE and three Q36H mice. Per sample, 4000 
to 8850 Myc“"?* GC B cells and 155,000 to 
250,000 Myc“"* GC B cells were sorted, of 
which 1.9 to 3.8 ng (Myc“?*) and 60 to 300 ng 
(Myc@??-) RNA was extracted. Eleven PCR 
cycles and 0.95 ng RNA were used for all sam- 
ples for library preparation using the Ultra- 
low input RNA kit (Takara SMART-Seq v4), 
and samples were subjected to single read 
50 bp sequencing on an Illumina HiSeq. 

For Bcl2+Q36H and Bcl2 lymphoma B cells 
RNA-seq, 300,000 GC B cells from Bcl2 and 
Bcl2+Q36H mice were sorted as live B220* 
FAS*GL7' cells from PNA* enriched splenocytes 
on BD Influx or BD FACSAria II at 8 months 
after transplantation. Per sample, 150 ng RNA 
was used for library preparation using Illu- 
mina TruSeq stranded-mRNA sample kit fol- 
lowed by single read 50 bp sequencing on an 
Illumina HiSeq. 

For SU-DHL4 RIP-sequencing, input and 
RIP RNAs were obtained from two indepen- 
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dently generated stable lines, each RIPed in 
duplicates from 35 million cells each as de- 
scribed above. We recovered ~7 to 14 ug RNA 
from input samples and 500 to 1700 ng RNA 
from RIP samples. We added ERCC RNA spike- 
ins (Thermo Fisher 4456740) per same volume 
of RNA for all input or for all RIP samples 
to be able to normalize for possible global 
changes in starting RNA amount and pull 
down RNA amount between samples. Of these, 
we used 250 ng RNA in 50 ul for each of the 
input and RIP samples for library prepara- 
tion using TruSeq-stranded-mRNA sample 
kit. We used 22 nmol library for all input and 
RIP samples, except for the EGFP-V5 RIP 
samples, for which we added a maximum of 
11 nmol each (2:1 ratio, normalized during 
analysis). Samples were subjected to paired- 
end 50 bp sequencing on a NovaSeq SP at 
700 million reads to achieve 29 million reads 
per sample. 

For BCR sequencing analysis, we used re- 
maining splenocytes from Bcl2 and Bcl2+Q36H 
mice at 8 months after transplantation that 
were not used for PNA enrichment and GC 
B cells sorting, had been treated with red blood 
cell lysis solution (QIAGEN 158904), and viably 
frozen in FBS with 10% DMSO. We thawed 
these splenocytes and extracted gDNA using 
Puregene Gentra cell kit (QIAGEN 158388). 
gDNA (1.5 ug) was sent to Adaptive Bio- 
technologies for ImmunoSeq assay to se- 
quence a 125-bp-long VDJ region around the 
IgH complementarity-determining region 
3 (CDR3; see fig. S16B). Briefly, the somat- 
ically rearranged IgH CDR3 was amplified 
using a two-step amplification, bias-controlled 
multiplex PCR approach (76). The first PCR 
consisted of forward and reverse amplification 
primers specific for every V and J gene seg- 
ment and amplified the hypervariable CDR3 
of the immune receptor locus. The second PCR 
added a proprietary barcode sequence and Il- 
lumina adapter sequences. 


RNA-sequencing analysis 

For all RNA-seq analyses, results were aligned 
to mm10 or hg19 using STAR (77) and annotated 
to RefSeq using the Rsubread package (78). Dif- 
ferential expression was called using edgeR 
generalized linear model with TMM normal- 
ization (79, 80) with thresholds of FC >1.5 and 
Benjamini-Hochberg corrected P < 0.01. Murine 
Bcl2 and Bcl2+Q236H tumors were addition- 
ally modeled using a batch correction for the 
respective cell sorter and, because of sex im- 
balance between conditions, only autosomal 
genes were used for analysis. 

Gene set enrichment was assessed using the 
GSEA algorithm, a computational method 
based on the Kolmogorov-Smirnov test (34). 
GSEA network analysis was performed by 
running GSEA using the Hallmark and Canon- 
ical Pathways gene sets from MSigDB. Jaccard 
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similarity index was calculated for all pairs of 
leading-edge genes from significantly enriched 
signatures (FDR < 0.05; table S3). Network 
was then generated using the R igraph pack- 
age (81), adjusting size of node for degree of 
connectivity. 

Unsupervised pathway analysis was per- 
formed using information-theoretic pathway 
analysis approach as described previously (82). 
Briefly, pathways that are informative about 
nonoverlapping gene groups were identified. 
Pathways annotations were used from the 
Biological Process annotations of the Gene 
Ontology database (http://www.geneontology. 
org), Human Protein Reference Database 
(http://www.hprd.org), MSigDB C2 (http:// 
www.gsea-msigdb.org/gsea/msigdb/human/ 
genesets.jsp?collection=C2), and signature 
categories from the Staudt Lab Signature data- 
base (83). Only human-curated annotations 
were used from the Gene Ontology database, 
and only pathways with five genes or more, 
and with 300 genes or less were evaluated. 
This pathway analysis estimates how inform- 
ative each pathway is about the target gene 
groups and applies a randomization-based 
statistical test to assess the significance of the 
highest information values. We used the de- 
fault significance threshold of P < 0.005. We 
estimated the FDR by randomizing the input 
profiles iteratively on shuffled profiles with 
identical parameters and thresholds, finding 
that the FDR was always lower than 5%. For 
each informative pathway, we determined 
the extent to which the pathway was over- 
represented in the target gene group, using 
the hypergeometric distribution, as described 
previously (82). 

Unsupervised hierarchical clustering was 
performed on RNA-seq using most variable 
genes (95th percentile for standard deviation 
across all samples). Euclidean distance was cal- 
culated using log-transformed expression values 
for GC B cell RNA-seq from CRE and Q36H 
samples [log(TPM+0.1)] and log-transformed 
FPKM expression values adjusted for sorter 
batch effect using the ComBat function of the 
R package sva (84, 85) with parametric empir- 
ical Bayes adjustment for the Bcl2 and Bcl2 
+Q36H lymphoma RNA-seq. Hierarchical clus- 
tering was then performed using Ward’s mini- 
mum variance method. 


Motif analysis 


HOCOMOCO (v11) motifs (86) were assessed 
for enrichment in gene promoter ( TSS + 2kb) 
using CentriMO tool from the MEME frame- 
work (87, 88). Comparative enrichment was 
evaluated within genes found unregulated in 
BIG1%*"' versus genes that were expressed in 
SU-DHL4 cells (FPKM > 0.1) and were non- 
differentially expressed. We performed exact 
test to evaluate the significance of the num- 
ber of best matches in the region in the pri- 
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mary set compared with the number of best 
matches in the same region in the control set 
of sequences. 


RIP sequencing analysis 


RIP sequencing results were aligned to a 
merged hg19-ERCC genome using STAR and 
annotated to RefSeq using the Rsubread pack- 
age. Read counts between samples were nor- 
malized to ERCC spike-ins using the RUVg 
method (89). Differential RIP abundance was 
called by a linear additive model using edgeR, 
adjusting for ERCC spike-ins and subtracting 
input transcript level. Transcripts found to be 
lost in BTG12?" versus BTGI”” pull-downs 
were filtered to remove any transcripts that 
were present less than twofold in BTG1”’ RIP 
versus input. 


Targeted single-cell RNA sequencing 


We sorted competing Q36H (CD45.1/2) and 
CREneg control (CD45.1) GC B cells (DAPI- 
B220*CD4 FAS*CD38°) as well as Try cells 
(DAPI-B220°CD4*CXCR5™ PD-1") from n = 
4 mice on a BD Influx sorter as described 
above. Samples were tagged, counted, and 
multiplexed into three replicates at a Q36H: 
CREneg:TFH ratio of 60:28:12. BD Rhapsody 
data were generated according to the man- 
ufacturer’s protocol. Briefly, a total of 25,000 
to 30,000 cells was loaded per BD Rhapsody 
Cartridge, into two cartridges for single-cell 
capture, followed by reverse transcription and 
exonuclease I treatment, library preparation, 
and sequencing on a NextSeq500, using 75 bp 
paired-end sequencing. The targeted panel 
of n = 496 genes (table S7) comprised BD 
Rhapsody mouse immune response panel (BD 
633753, n = 397) plus a custom set of genes 
implicated in GC or lymphoma biology (n = 
99) designed with help from BD. Fastq files 
were uploaded to Seven Bridges Genomics. 
Data were demultiplexed and sequences were 
analyzed with BD’s Rhapsody pipeline (BD 
Rhapsody Analysis Pipeline 1.4 Beta) on Seven 
Bridges (www.sevenbridges.com). This pipe- 
line included RSEC and DBEC filtration of 
UMI counts and generated a sparse matrix file 
of features by barcodes. This sparse matrix 
data were read into R using the R package 
Seurat v3.0.2 (90). Data were then scaled and 
log normalized. 

We focused on GC B cell clusters for down- 
stream analysis, on the basis of Cd19 expression 
and absence of Cd4 or Cd8 expression. Linear 
dimensional reduction was performed by calcu- 
lation of PCA from the most variable genes. 
Cells were then clustered using a resolution 
value of 0.5 and visualized by UMAP deter- 
mined from dims 1:10. Module scores were 
calculated using the AddModuleScore func- 
tion with a control value of 5. Individual genes 
and module scores were projected and used 
for appropriate classification of clusters. Single 
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cells were assigned to populations (DZ, LZ, or 
LZ-to-DZ recycling) on the basis of high score 
of gene markers and module scores per cluster. 


Single-cell trajectory analysis 


Cell cycle phase identities were assigned by 
the Seurat CellCycleScoring function. RNA tra- 
jectory analysis was performed using the R 
package Slingshot v1.2.0 (91). This package 
was used to create a “pseudotime” based on 
a combination of PCAs 1 and 2 calculated by 
Seurat, using cells identified as S phase cells 
as the anchor point. Cells that belonged to a 
pre-memory B cell cluster on the basis of Cd38 
expression were excluded, and lineage plots 
were generated by projecting pseudotime onto 
cells mapped by PCAs 1 and 2. Pseudotime 
density plots were generated using cell cycle 
phase identity and the ggplot2 v3.2.1 geom_ 
density function. The difference in density be- 
tween genotypes was determined by using the 
density function in R on the pseudotime den- 
sity plot for each genotype and subtracting the 
CREneg density Y values from the Q36H den- 
sity Y values. These differences were then 
plotted by ggplot2 geom_col function. Dif- 
ference in density distribution was tested by 
Wilcoxon rank sum test on the pseudotime 
values for each genotype. Scatterplots were 
created by plotting pseudotime on the axis 
and module scores on the y axis. Average 
score and confidence interval per genotype 
were plotted with the geom_smooth function. 


WGS driver analysis 


Whole-genome sequencing (WGS) alignments 
for 101 DLBCL and matched normal sample 
pairs were retrieved as hg19-aligned BAM files 
from the European Genome Archive (https:// 


ega-archive.org/, EGAD00001002123) and 


stripped of alignment information via Picard 
tools RevertSam, according to GATK best 
practices (https://software.broadinstitute.org/ 
gatk/best-practices) to generate unmapped 
BAM files. The unmapped BAM files sub- 
sequently underwent alignment to hg38 and 
somatic variant calling for SNVs, indels, and 
structural variants (SVs) via the Sarek pipe- 
line v2.5.1 (https://github.com/nf-core/sarek). 
Briefly, the Sarek pipeline (92) applies BWA 
mem alignment, duplicate removal, base re- 
calibration, somatic SNV/indel calling via 
Strelka2 (93), somatic SV calling via Manta 
(94), and variant annotation with VEP (95). In 
summary, this pipeline yielded VCF files con- 
taining 1.36 million SNVs across 101 DLBCL 
cases, which were used for downstream analyses. 

FishHook (https://github.com/mskilab/ 
fishHook) was used to model background 
mutational processes and nominate DLBCL 
mutational hotspots across the cohort of 101 
DLBCL cases (see above). FishHook takes as 
input mutation calls (e.g., VCF), a set of hy- 
pothesis intervals (.bed files), an eligible territory 
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(bed file), and a set of genomic covariates (.bed, 
-bw files) and outputs a model of mutation 
density and hypothesis intervals associated 
with a P value and enrichment score. Briefly, 
fishHook (96) models genome-wide somatic 
mutation density in tumor genomes across 
the intersection of an eligible territory (e.g., 
coding sequences, mappable genomic regions) 
and set of hypothesis intervals (e.g., genes) 
as a function of genomic covariates, which 
can represent sequence context composition, 
chromatin features, or the fractional overlap 
with reference genomic annotations. The max- 
imum likelihood fit of a fishHook model, im- 
plemented as a gamma-Poisson regression, 
assigns weights to covariates and an expected 
mutation density to each hypothesis interval. 
An enrichment value is computed at each 
hypothesis interval as the ratio of observed 
to expected mutation density. The model is 
further used to define a cumulative distribu- 
tion function (CDF) for mutation density at 
each interval. Each interval is then assigned a 
one-sided P value as the probability that the 
mutation density is greater or equal than the 
observed density. 

FishHook was applied to analyze the dis- 
tribution of 1.36M DLBCL SNVs across 24,498 
protein coding genes (GRCh38) using coding 
sequences as the eligible territory a set of ge- 
nomic covariates representing B cell-specific 
transcriptional and chromatin state. To gen- 
erate the set of covariates, we annotated .bed 
files of protein coding genes with the number 
of ATAC-seq peaks from human purified GC 
B cells (unpublished data) within 100 kb, the 
number of GC B cell H3K27ac ChIP-seq de- 
fined super-enhancers within 100 kb, and 
the number of TSS within 100 kb for genes 
expressed >1 TPM in human GC B cells (RNA- 
seq). Super-enhancers were called from previ- 
ously reported H3K27ac ChIP-seq data from 
human purified GC B cells (97) using the ROSE 
method (98).The resulting fishHook model 
with three covariates was fit to annotate 
genes with P values and enrichment scores 
(table S1). We labeled genes with (Benjamini- 
Hochberg) FDR < 0.01 as significant. We gen- 
erated Q-Q plots by pairing observed —log10 
transformed quantiles of observed P values 
(y axis) with their corresponding —log10 trans- 
formed quantiles from the uniform distribu- 
tion (w axis). A genomic inflation factor 1 was 
computed as the slope of a least-squares re- 
gression line fitting these data while intercepting 
the origin. The value of 4 (<1.05) was consist- 
ent with minimal statistical inflation (99). 


BCR sequencing analysis 


Initial immunoSeq BCR sequencing data pro- 
cessing was performed using the immunoSeq 
Analyzer 3.0 (Adaptive Biotechnologies). Only 
sequences associated with at least five tem- 
plates were retained for further analysis. BCR 
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repertoire sequencing data was then analyzed 
using the Immcantation (www.immceantation. 
org) framework. VDJ genes for each sequence 
were aligned to the IMGT GENE-DB database 
accessed 22 June 2021 (100) using IgBlast 
v1.17.1 (01) and Change-O v1.1.0 (102). Only 
IMGT V-gene sequences obtained from C57BL/ 
6 mice were included in the reference data- 
base. Nonproductive BCRs were removed. Se- 
quences were grouped into clonal clusters, 
which contain B cells that relate to each other 
by somatic hypermutations from a common 
VDJ ancestor. Sequences were first grouped 
by common IGHV gene annotations, IGHJ 
gene annotations, and junction lengths. Using 
the hierarchicalClones function of scoper v1.2.0 
(103), sequences within these groups differing 
by a length normalized Hamming distance of 
0.1 within the CDR3 region were defined as 
clones using single-linkage hierarchical clus- 
tering (104). This threshold was determined 
through manual inspection of distance to 
nearest neighbor plots using shazam v1.1.0 
(105). Within each clone, germline sequences 
were reconstructed with D segment and N/P 
regions masked (replaced with “N” nucleo- 
tides) using the createGermlines function 
within dowser (106). 

Somatic hypermutation was calculated as the 
Hamming distance between each sequence 
and its predicted unmutated germline ances- 
tor. Because immunoSEQ data contains only 
the partial V-gene sequence, only between 69 
and 108 nucleotides of region FWR3 were 
used in this calculation. To quantify B cell 
clonal diversity, we calculated Simpson’s diver- 
sity for each sample using the alphaDiversity 
function of alakazam v1.2.0 (102). Lower values 
of Simpson’s diversity indicate a greater 
probability of two random sequences belong- 
ing to the same clone, consistent with more 
large clones. To account for differences in se- 
quence depth, samples within each compari- 
son were down sampled to the same number 
of sequences, and the mean of 1000 such re- 
sampling repetitions was reported. Samples 
with fewer than 30 sequences were excluded. 
Clonal lineage trees were built using dowser 
v1.0.0 (106) and IgPhyML v1.1.4 (50). Because 
predicted germline junction sequences are 
masked, only V-gene sites were used to build 
lineage trees and estimate substitution model 
parameters. Codons split by insertions relative 
to the IMGT germline reference sequences 
were masked using the maskSequences func- 
tion in dowser. Lineage tree topologies, branch 
lengths, and subject-wide substitution model 
parameters using maximum likelihood under 
the GY94 model (107). Using fixed tree to- 
pologies estimated from the GY94 model, we 
then estimated branch lengths and subject- 
wide parameter values under the HLP19 model 
(50). Only clones containing at least two unique 
sequences and a total >100 sequence reads 
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were included. Trees were visualized using 
dowser and ggtree v3.3.1.901 (108). 


In silico GC modeling 


An agent-based GC model with molecular 
events of GC B cell selection based on DisseD 
theory (47) was used to perform in silico simu- 
lations. The model uses elements described 
before (109-111) and differential equations de- 
scribing evolution of the signaling molecules 
mTOR, Myc, and FoxO. A three-dimensional 
lattice with chemokine gradients is used to 
represent the GC area and incorporates ran- 
domly distributed Ty, cells and follicular den- 
dritic cells. Founder GC B cells enter the GC 
area with a fixed rate for an initial time period, 
undergo six divisions, and eventually acquire a 
LZ phenotype. LZ B cells interact with follicu- 
lar dendritic cells, resulting in the acquisition 
of antigen with an affinity-based probability. A 
four-dimensional shape space (772) is used to 
represent the B cell affinity, where the position 
of a B cell with respect to a predefined optimal 
position denoting the antigen corresponds to 
the affinity of the cell. Successful antigen col- 
lection is followed by interactions with Try 
cells. When multiple B cells are bound to the 
same Try cell, the Try cell polarizes toward 
and signals the B cell with the highest amount 
of antigen collected. Acquisition of antigen 
and interaction with Ty cell induces changes 
in the levels of mTOR, Myc, and FoxO mol- 
ecules. Time period of interaction with follicu- 
lar dendritic cells and Ty cells ends, and a 
fate decision is initiated, either at the time 
when mTOR reaches a threshold value or when 
the time exceeds a maximum time period of 
18 hours. GC B cells with Myc and mTOR 
levels reaching predefined threshold values 
are selected. Selected cells return to the DZ 
phenotype and divide a certain number of 
times, as determined by Myc levels at the time 
of selection by Ty cells. Detailed description 
of the GC model and parameter values are 
provided in the supplementary text of (47). 

Two models were considered to recapture 
the behavior of BTG1 mutant GC B cells. In 
the first model, we considered a faster up- 
regulation of mTOR and Myc: The rate of 
mTOR and Myc up-regulation in Btg1 mutant 
GC B cells upon contact with TFH cells was 
increased 1.12 times as compared with WT 
cells. In the second model, we considered a 
faster up-regulation of mTOR and Myc, coupled 
with a shorter S phase duration: In addition to 
the 1.12 times faster up-regulation of mTOR 
and Myc, Btgl mutant GC B cells were as- 
sumed to undergo accelerated S phase pro- 
gression. Average S phase duration of mutant 
GC B cells was shortened by 21% as compared 
with WT cells. 

Each of the two models were tested with 
different simulation setups, corresponding to 
the competitive and noncompetitive in vivo 
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experiments. In the competitive setup, found- 
er GC B cells at the start of the simulation 
were assigned as either Btgl] mutant or WT 
with a probability of 0.5, for equal starting 
proportions of mutant and WT GC B cells. 
Positions of GC B cells in the shape space 
were chosen at a distance of 2 from the opti- 
mal position, to mimic the high affinity of 
B1-8" cells. For the noncompetitive setup, 
separate simulations were performed with 
100% cells being WT or Btgl mutant GC 
B cells, and affinities were randomly chosen. 
Each simulation was repeated 300 times, and 
the mean and SD values were calculated. To 
quantify the relative difference between mu- 
tant and WT, mutant readouts were normal- 
ized with corresponding WT readouts. In vivo 
experimental data were normalized in a sim- 
ilar way for comparison to in silico simula- 
tion results. Simulations were performed with 
C++, and statistical tests were performed 
using R. 


Survival analyses on human ABC-DLBCL cases 


Kaplan-Meier estimator was used to estimate 
the overall survival probability followed by 
log-rank test to compare the difference be- 
tween BTGI mutational status. Univariable 
and multivariable Cox regression were used to 
assess the overall survival difference between 
BTGI1 mutational status, adjusting for IPI, 
LymphGen call, and sex. All analyses were 
performed in statistical software SAS v9.4 
(SAS Institute). 


Statistics 


For all statistical analyses, sample size (7) for 
each experimental group, the nature of repli- 
cates, and details for statistical tests used were 
specified either in the figure legend, main text, 
or the methods section. 
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Heritable defects in telomere and mitotic function 
selectively predispose to sarcomas 
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Cancer genetics has to date focused on epithelial malignancies, identifying multiple histotype-specific 
pathways underlying cancer susceptibility. Sarcomas are rare malignancies predominantly derived from 
embryonic mesoderm. To identify pathways specific to mesenchymal cancers, we performed whole- 
genome germline sequencing on 1644 sporadic cases and 3205 matched healthy elderly controls. 
Using an extreme phenotype design, a combined rare-variant burden and ontologic analysis 
identified two sarcoma-specific pathways involved in mitotic and telomere functions. Variants in 
centrosome genes are linked to malignant peripheral nerve sheath and gastrointestinal stromal 
tumors, whereas heritable defects in the shelterin complex link susceptibility to sarcoma, melanoma, 
and thyroid cancers. These studies indicate a specific role for heritable defects in mitotic and 


telomere biology in risk of sarcomas. 


ommon and rare genetic variation cur- 
rently explains less than 50% of the fa- 
milial relative risk for cancer, leaving the 
majority of heritability unexplained (J, 2). 
Nonetheless, genetic studies focused on 
common epithelial cancers have yielded major 
insights into the biological mechanisms under- 
pinning specific cancer susceptibilities, exem- 


plified by homologous recombination in breast 
cancer and mismatch repair in colorectal can- 
cer (3, 4). Studies of different cancer popula- 
tions may therefore yield further insights into 
cancer biology. Differing fundamentally from 
epithelial cancers, sarcomas are rare connec- 
tive tissue malignancies that arise predominant- 


ly from embryonic mesoderm and affect a 


younger population (5). Because of their rarity, 
they have been relatively understudied to date 
at the population level. For the most part, gen- 
etic studies into sarcomas have used either 
familial linkage approaches or genome-wide 
association studies (GWAS). Studies of rare 
sarcoma-associated syndromes have led to dis- 
covery of key cancer genes, such as the canoni- 
cal tumor suppressor, TP53 (6). Whole-exome 
sequencing (WES) or whole-genome sequenc- 
ing (WGS) are now being used to catalog rare 
variants in known genes (7, 8). In principle, 
combining both population- and family-based 
WGS approaches could uncover additional 
genes and pathways by integrating statistical 
methods with clinical information. In this 
study, we undertook a comprehensive, popu- 
lation-based, case-control study using WGS to 
identify penetrant genes and pathways that 
may explain sarcoma risk. 


Results 
Clinical findings 


In total, 1644 sarcoma probands were recruited 
from sarcoma clinics in this international 
multi-institutional study, regardless of family 
history (tables S1 and S2). Subjects had a me- 
dian age at first cancer diagnosis of 47 years, 
and 49 years at first sarcoma diagnosis. Soft- 
tissue sarcomas constituted 78.2% of diagnoses 
(Table 1 and table S3). Multiple primary cancers 
were common, including breast (n = 77), mela- 
noma (n = 37), second connective tissue tumors 
(n = 37), nonmelanoma skin (7 = 35), prostate 
(n = 23), colorectal (2 = 21), and thyroid can- 
cers (7 = 17). Because radiation is a known risk 
factor for sarcomas (9), we determined the rela- 
tionship with therapy for a prior cancer. In 110 
(38%) of the 293 individuals with multiple pri- 
mary cancers, the sarcoma was the first tumor 
diagnosed. In 183 individuals where sarcoma 
was the second tumor diagnosed, less than 
one-third (56) occurred within a prior radiation 
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Fig. 1. A systematic analysis of genes and pathways implicated in sarcoma. 
(A) A schematic of the analytic approach of progressively enriching for genes 
and pathways carrying an excess burden of pathogenic variation in sarcoma 
probands compared to a universal contro! cohort (MGRB), including correction 
for age-depletion of nonsarcoma-related genes. Graph representations of the 
968 genes (nodes) in the secondary gene set and their interactions (edges). The 
analysis was performed in Cytoscape. Node color represents the P-value of 
WRVBT enrichment for each gene in the sarcoma probands relative to the MGRB, 
the size of the node represents the excess weighted burden of pathogenic 


border represents protein interaction and ontologic enrichment by a combined 
rankscore. Specific groups (1 to 3) relevant to subsequent analyses are 
highlighted (pale blue boxes). (B to F) Specific clusters are as follows: (B) Graph 
representation of the super cluster of 187 genes (nodes) in the secondary 
geneset centered on 7P53 and their interactions (edges); (C) A 22-gene 
cluster representing ontologies linked to RNA processing and activity in the 

M phase of the cell cycle. (D) Eleven genes implicated in antigen processing, 
ubiquitination, and neddylation. (E) Seven genes defined by centrosomes, spindle 
organization, and the G2/M phase of the cell cycle. (F) Five-gene clique with 


variants in the sarcoma probands relative to MGRB, and the width of the node 


field. Of 60 women who developed a sarcoma 
after breast cancer, in more than half (32), the 
sarcoma arose outside the thorax. These data 
suggest that the majority of sarcomas in indi- 
viduals with multiple primary tumors are not 
therapy related. Upon expert review of pedi- 
grees, more than 20% of families met formal 
criteria for known hereditary syndromes, in- 
cluding Li Fraumeni Syndrome (LFS), hered- 
itary breast and ovarian cancer (HBOC), and 
familial melanoma, or were considered to have 
unusual cancer patterns (Table 1) (10). Collec- 
tively, these data suggest a strong genetic basis 
for apparently sporadic sarcomas. 


Spectrum of mutated cancer genes in the 
sarcoma population 


After undertaking WGS, we identified 37,820 
rare single-nucleotide variants and insertions 
or deletions (SNV/indels), of which 1033 were 
known pathogenic or likely pathogenic (C5), 
10,702 were new loss-of-function or protein 
truncating (C4), and 26,085 were possibly 
pathogenic (C3) (fig. S1). One hundred and 
nine sarcoma probands (6.6%) carried 112 C4 
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a role in mitochondrial tra 


or C5 variants in clinically relevant cancer 
genes based on an extended list from the 
American College of Medical Genetics (ACMG) 
(table S4) (77), including 13 with variants in 
ATM and TP53, 10 NFI, 9 BRCA2, 5 CHEK2, 
4 MSH6, 4 PALB2, 4 LZTR1, 3 each in BRIPI 
and EXT2, 2 each in APC, BRCAI, CDKN2B, 
ERCC2 (both compound heterozygotes), MSH2, 
RHBDF2, SDHA, SDHB, and WT1, and 1 each in 
AIP, BAPI, BMPRIA, BUBIB, DICERI, EXT], FH, 
FLCN, HNFIA, MLH1, MUTYH (homozygote), 
PMS2, POLD1, POLE, PTEN, PTCH1, PTPN11, 
RADSIC, RAD5ID, RBI, SDHC, and SMARCB1 
(data file S1). One individual carried C4 or C5 
variants in both SDHB and MSH6. Carriers of 
C4 or C5 variants were younger at first cancer 
diagnosis (median age of 41 years) than the re- 
mainder of the cohort (median age of 48 years; 
P = 0.0003) and were more likely to develop 
multiple primary cancers {29 of 101 probands 
with sufficient clinical information compared 
with 264 of 1478; odds ratio of 1.85 [95% con- 
fidence interval (CI) 1.18-2.91]; P = 0.011}. Fewer 
than one in six of these probands had expected 


nslation and ribosomes. 


types (12). Although 10 out of 13 (77%) carriers 
of C4 or C5 TP53 SNV/indels met classic or 
Chompret criteria for LFS, only 10 out of 164 
(6.1%) families meeting either LFS criteria 
carried a C4 or C5 TP53 SNV/indel. Other genes 
in which C4 or C5 variants were detected in 
families meeting LFS criteria were MSH6 (2), 
ERCC2 (2), BUBIB, LZTR1, CHEK2, HNFIA, 
PTEN, PMS2, BRCA2, MSH2, and SDHB. Only 
1 of 15 (6.7%) probands with C4 or C5 SNV/ 
indels in BRCAI, BRCA2, or PALB2 met criteria 
for HBOC (13), whereas only 1 of 27 (4%) fam- 
ilies with clinical features of HBOC carried a 
cognate C4 or C5 variant (PALB2). Figure $2 
illustrates a sarcoma family carrying a patho- 
genic BRCAI variant, which does not meet 
HBOC criteria. Twelve sarcoma families met 
criteria for both HBOC and classic or Chompret 
criteria for LFS, none of which were explained 
by C4 or C5 variants in BRCAI, BRCA2, PALB2, 
or TP53. None of 9 probands with pathogen- 
ic SNV/indels in mismatch repair genes met 
Amsterdam criteria for colorectal cancer (14), 
and none of 15 probands meeting familial 


clinical phenotypes that were based on geno- 


melanoma criteria (15) carried pathogenic 
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SNV/indels in the gene most strongly asso- 
ciated with familial melanoma, CDKN2A (16). 
By contrast with SNV/indels, structural var- 
iants appeared to make little contribution to 
pathogenic burden. Potentially germline path- 
ogenic rare structural variants were identified 
in BRCA2 (proband with two breast cancers 
and an undifferentiated pleomorphic sarcoma), 
SDHC [gastrointestinal stromal tumor (GIST)], 
and MSH6 (proband family met Amsterdam 
criteria for Lynch syndrome). All genes and 
clinical correlates are shown in data file S1. 


Gene and pathway discovery analyses 


To maximize power, we applied an extreme phe- 
notype case: control design using 3205 healthy 
elderly controls [from the Medical Genome Ref- 
erence Bank (MGRB)], which we previously 
showed are depleted in cancer-associated ge- 
netic variation relative to population-based 
controls (17). A weighted rare-variant burden 
test (WRVBT) compared the relative burden 
of C3 to C5 variants in sarcoma probands 
(discovery set) to the MGRB, after controlling 
for gender, relatedness, and population struc- 
ture (Fig. 1A) (78). Sensitivity analyses were 
performed to control for critical parameters 
including population stratification (18). A 
relaxed P value of < 0.1 was applied to generate 
an inclusive primary gene set of 1176 genes 
(Fig. 1A). To control for noncancer, age-related 
genotypes depleted in MGRB (for example, 
cardiovascular disease or dementia), we used 
the Australian Schizophrenia Research Bank 
(ASRB) (17) as a second control set with a 
median age of 39 years. Genes also enriched 
in ASRB compared to MGRB were excluded 
as age- rather than sarcoma-related. A sec- 
ondary gene set of the top-ranked 968 genes 
by pathogenic burden was subjected to protein 
interaction- and pathway-based analysis. 
These analyses revealed gene clusters that 
were composed of highly interrelated subclus- 
ters, which we termed cliques. Of 277 (28.6%) 
genes with at least one first-degree neighbor, 
224 were involved in clusters of five or more 
genes. A supercluster of 187 genes centered on 
TP53 (Fig. 1B), the strongest known sarcoma 
risk gene. 7P53 not only had the greatest bur- 
den of C3 to C5 variants but also had the 
highest number of first-degree neighbors 
(n = 27). The second largest cluster (Fig. 1C) 
contained 22 genes, representing ontologies 
linked to RNA processing and M-phase of the 
cell cycle, kinetochores, and chromatid sep- 
aration. A third cluster (Fig. 1D) comprised a 
clique of 11 genes implicated in antigen pro- 
cessing, ubiquitination, and neddylation. A 
fourth seven-gene clique (Fig. 1E) was defined 
by centrosomes, spindle organization, and the 
G»/M phase of the cell cycle, and a five-gene 
clique (Fig. 1F) reflected a role in mitochon- 
drial translation and ribosomes. The degree 
of interconnectedness within the secondary 
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Fig. 2. Sarcoma-specific enrichment in rare pathogenic variants in the shelterin and centrosome 
pathways. Relative enrichment of pathogenic variants (Log2 Odds Ratio) in sarcoma probands, a subset of 
157 cases with GIST or MPNST, and a nonsarcoma cancer population of 632 individuals with epithelial 
cancers. Gene sets include TP53 alone (TP53); BRCAI, BRCA2, and PALB2 (HBOC); POT1, TINF2, TERF1, 
TERF2, TERF2IP, STAG3, SMARCALI, and TIMELESS (shelterin); NF1, SDHA, SDHB, SDHC, SDHD, and LZTR1 
(NF1); and CEP63, CEP72, HAUS4, HAUS5, MZT1, and SSNAI (centrosome). Circle size reflects the odds 
ratio, and the color represents the P value of the enrichment in each group relative to MGRB. Pairwise 
comparisons between groups are shown (*P < 0.05; **P < 0.01; ****P < 0.0001). 


gene set suggests enrichment in distinct func- 
tional pathways. 

To further enrich for the most important 
pathways, we focused on the highest-ranked 
85 genes with a P value < 0.05 by WRVBT and 
protein interactions (tertiary gene set) (Fig. 1A). 
Consistent with known roles in sarcoma sus- 
ceptibility, the tertiary gene set included TP53, 
NFI, EXT], and EXT2 [UniProt Keywords KW- 
0043; false discovery rate (FDR) = 2.4 x 107°], 
as well as PTCH1 (Table 2). Three ontological 
groups of interest are highlighted in blue (Fig. 
1, B and E). Ontologic group 1 comprised the 
shelterin complex, represented by three genes: 
POTI, TERF1, and TINF2 (GO Component 
GO.0070187; FDR = 2.8 x 10°) (Table 2). 
Group 2 comprised the mitotic spindle, includ- 


ing the centrosomal genes CEP63, HAUS4, 
and HAUS5 (GO Component term GO.0005819; 
FDR = 4.5 x 107°) (Table 2). Group 3 includes 
EXT1 and EXT2, linked to hereditary exostoses 
and bone sarcomas (UniProt Keywords KW- 
0361; FDR = 2.3 x 10~*) (Table 2). 

We next determined whether these path- 
ways were specific for sarcomas by compar- 
ing sarcoma probands to a population with 
predominantly epithelial cancers (7 = 632) 
(table S5). As a positive control, an enrich- 
ment in pathogenic burden was seen for TP53 
in sarcoma probands compared with the epi- 
thelial cases. As a negative control, the reverse 
was observed with a HBOC gene set (BRCAI, 
BRCA2, and PALB2). Sarcoma-specific en- 
richment was observed in both the shelterin 
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Fig. 3. Replication of enrichment in A 
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datasets for shelterin, centrosome, and 
sarcoma gene sets are 5.58 [3.11, 
9.64], 4.74 [1.90, 11.83], and 10.59 
(4.36, 25.72], respectively. The 
corresponding |? values are 0% for all 
the gene sets. (B) This analysis 

used the Hartwig Foundation whole- 
genome germline dataset of 

4178 cancers, including sarcomas 
(276, including 71 GIST or MPNST), 
breast cancer (801), bowel cancer 
(650), lung cancer (583), prostate 
cancer (412), kidney and urothelial 
cancers (312), melanoma (300), 
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esophagastric cancer (193), ovarian and fallopian tube cancers (174), pancreatic cancer (149), hepatobiliary cancer (138), brain cancer (76), uterine adenocarcinoma 
(73), and mesothelioma (41). Gene sets were defined as for Fig. 2. Enrichment for each cancer class was determined by normalizing the frequency of variants in 


that class to the overall frequency in the entire dataset. These data were subject to unsupervised hierarchica 


and the centrosome gene sets (Fig. 2). For 
the centrosome set, further enrichment was 
specifically observed in malignant peripheral 
nerve sheath tumors (MPNSTs) and GISTs, 
which share neural origins and NF/ as a sus- 
ceptibility gene (5, 19). The degree of enrich- 
ment in centrosome genes in MPNSTs or GISTs 
was comparable to that seen for NFI and 
related genes such as LZTR1, SDHA, SDHB, 
SDHC, and SDHD. A follow-up validation 
analysis of the discovery set with an itera- 
tive (1000-fold) resampling method also re- 
capitulated the enrichment of rare variant 
burden in the shelterin and centrosome gene 
sets (fig. S3). 
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Independent replication of pathway enrichment 

We next sought to validate these findings and 
to exclude population stratification. First, we 
compiled an independent WGS and WES repli- 
cation set (n = 839 sarcoma cases from the US, 
Netherlands, and Norway) and geographically 
matched independent control set (nm = 4094 
cancer-free controls) (Fig. 3A and table S2). 
A WRVBT was performed using C4 and C5 
variants in the shelterin, centrosome, and 
sarcoma gene sets, for each geographically 
matched case-control discovery and replication 
set, and after combining these for statistical 
power (Fig. 3A). Enrichment was confirmed in 
pathogenic variants in the sarcoma, shelterin, 


clustering for both cancer class and gene sets. 


and centrosome gene sets within the combined 
geographically matched French and Australian 
discovery sets, as well as in the combined geo- 
graphically matched replication sets from the 
Netherlands, US, and Norway, with no signif- 
icant heterogeneity across these populations 
(Fig. 3A). Residual population stratification 
was excluded as a cause for signal enrichment 
using principal components ancestral match- 
ing within the geographically matched popula- 
tions (figs. S4 and S5) (78). We next compared 
the relative burden of pathogenic variants in 
the sarcoma, shelterin, centrosome, and HBOC 
gene sets across a wide range of cancer types 
using the Hartwig Foundation dataset (table S6) 
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(20). We observed enrichment of HBOC genes 
in breast, pancreatic, ovarian, and fallopian 
tube cancers, but not in sarcomas (Fig. 3B). 
No enrichment was observed in nonsarcoma 
cancers for the sarcoma, shelterin, and centro- 
some gene sets. By contrast, a distinct relative 
enrichment in the shelterin and centrosome 
gene sets was again observed in the sarcoma 
cases, and specifically in GISTs or MPNSTs. 


Clinical, familial, and molecular 
genotype—phenotype patterns 


Of 19 individuals carrying C4, C5, or essential 
splice-site variants in the centrosome core 
genes CEP63, CEP72, HAUS4, and HAUSS, as 
well as in the related genes CEP89, SSNA1, 
and PCM1 (table S7), 8 (42%) developed GIST 
(7) or MPNST (1). Notably, one individual 
carried variants in both HAUS4 and HAUS5. 
An additional 20 individuals carried one C3 
variant each in CEP57, CEP63, CEP89, HAUS4, 
HAUS5, PCM1, SSNAI, and MZT1. Consistent 
with the observations in individuals with C4 
or C5 variants, 6 of these 20 individuals (30%) 
with C3 variants developed either GIST (3) or 
MPNST (3). Altogether, 14 of 39 (86%) carriers 
of C3 to C5 variants in centrosome genes de- 
veloped MPNST or GIST, compared with 143 
of 1462 (9.8%) in the remaining cohort (RR 
4.029 [95% CI 2.372-6.142]; P < 0.0001). Of five 
carriers with GIST, four had somatic c-KIT 
variants, and one was wild-type. The median 
age at first cancer diagnosis in C4 or C5 variant 
carriers (43 years) was comparable to that of 
the cohort as a whole. Two of the centrosome 
pedigrees met clinical criteria for LFS, and 
another two were deemed clinically suspi- 
cious. Among individuals with MPNST or GIST, 
centrosome variants were mutually exclusive 
with C4 or C5 variants in genes known to 
cause these tumors: NFI (9), LZTR1, SDHA, 
and SDHB. An analysis of 4179 cancers, in- 
cluding 277 sarcomas, found that 2 of 24 carriers 
of germline pathogenic variants in centrosome 
genes showed somatic loss-of-heterozygosity, 
comparable to 5 of 26 cancers in individuals 
with pathogenic germline variants in NF1, 
SDHA, SDHB, and SDHD. These data sug- 
gest that defects in centrosome pathways 
appear to confer increased risk of MPNST 
or GIST. 

There were 13 carriers of C4 or C5 variants in 
four of the six canonical genes in the shelterin 
complex (Table 3), POTI (6), TERFI (2), TINF2 
(3), TERF2IP (2), and an additional 12 carriers 
of C4 or C5 variants in genes selected on the 
basis of their roles in telomere biology, TIME- 
LESS (3), STAG3 (3), and SMARCAL] (6) (21, 22). 
POTI, the only established hereditary can- 
cer gene in this group, has been most strongly 
linked to melanoma, but also has been re- 
ported in thyroid cancer and sarcoma (23). 
A loss-of-heterozygosity analysis performed 
as described above identified 9 somatic loss-of- 
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Fig. 4. Leukocyte telomere length in carriers of shelterin-complex variants. (A) Relative telomere 
lengths (RTL) were derived from telomere analysis (TelSeq) of whole-genome sequences on peripheral blood 
DNA for the sarcoma probands. The x axis indicates the age for each individual in years, and the y axis 
indicates RTL in arbitrary units. (B) Leukocyte RTL is longer in carriers of variants in the shelterin 
complex. Left panel: RTL in probands with shelterin complex C3 to C5 (n = 35) or C4 or C5 variants (n = 25) 
compared to the remainder of the sarcoma probands. Right panel: Age distribution for these groups. 

(C) Representative pedigrees showing autosomal dominant cancer patterns with an excess of melanoma. 
Top left panel: family with a POT1 pathogenic variant; top right panel: family with a TINF2 pathogenic variant; 
bottom panels: representative pedigrees of probands with long telomeres and excess of melanoma with 


no pathogenic variants identified. 


function events in 55 tumors from carriers 
of pathogenic germline variants in the shel- 
terin genes, including 2 of 8 in POT7 and 5 of 
19 in SMARCALI. 

Amongst carriers of shelterin C4 or C5 var- 
iants, formal review of pedigree patterns iden- 
tified four families meeting GenoMEL criteria 


for familial melanoma, and four families that 
met Chompret or classic LFS criteria. Two 
additional families meeting GenoMEL crite- 
ria carried C3 variants in POTI (G176R and 
D224N; fig. S6) (24). Notably, three individ- 
uals also had thyroid cancer. Several lines 
of evidence suggest the association between 
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Table 1. Clinical and pedigree characteristics of sarcoma probands. HNPCC, Hereditary 
nonpolyposis colorectal cancer; HBOC, Hereditary breast or ovarian cancer. 


Characteristic 


Gender 


N (%) 


Female 855 (52%) 


First cancer 


Classic or Chompret LFS 164 


Familial melanoma 15 


Clinically suspicious* 


Uninformative 


*Clinically suspicious: >0.5 cancers or FDR with =3 FDR reported and =1 FDR <50 years at diagnosis; three or more cancers 
per proband at any age; 1 or more sarcomas in an FDR; >1 sarcoma or connective tissue tumor at any age per 


proband; two cancers under 50 years of age per proband. 


*Neurofibromatosis type 1 (5), retinoblastoma (2), multiple 


endocrine neoplasia type 1 (2), familial papillary thyroid (1), familial paraganglioma (1), schwannomatosis (1), Gorlin syndrome (1), 


and McCune Albright syndrome (1). 


sarcoma, melanoma, and thyroid cancers 
is not therapy related. Both melanoma and 
thyroid cancer are curatively treated by sur- 
gery without genotoxic therapy; moreover, 
relatives of shelterin probands carrying C4 
or C5 variants also had an increased inci- 
dence of cancer {standardized incidence ratio 
(SIR) 2.06 [1.50-2.82]; P = 1.84 x 1074}, com- 
parable to that of 7P53 {SIR 2.59 [1.53-4.37]; 
P = 3.69 x 10°“*3. The risk of melanoma was 
markedly increased in families carrying C4 or 
C5 variants in the shelterin complex {SIR 
5.60 [3.25-9.65]; P = 5.61 x 10 1°}, as was the 
risk of thyroid cancer {SIR 19.74 [8.22- 
47.43]; P = 2.56 x 10°". An additional in- 
dividual carried a C4 or C5 variant in CTCI 
and another individual carried a C4 or C5 
variant in NHP2—, genes that are both im- 
plicated in telomere maintenance through 
the CST and telomerase complexes (25, 26). 
Taken together, these data suggest that heri- 
table defects in telomere function predispose 
to a syndrome of sarcoma, melanoma, and 
thyroid cancers. 

We next analyzed relative leukocyte telomere 
lengths (RLTL) derived from WGS. As expected, 
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RLTL progressively shortened over life span 
in sarcoma probands (Fig. 4A), unrelated to 
chemotherapy or radiotherapy exposure (J8). 
Despite a similar age at first cancer diagnosis, 
shelterin variant carriers had longer RLTL 
than the remainder of the cohort (Fig. 4B). No 
carriers had clinical or genomic evidence of 
clonal hemopoiesis, dysplasia, or leukemia (17). 
Independent of genotype, relatives of probands 
with long telomeres (>2 SD above the mean 
for age) had an increased risk of melanoma 
{SIR 3.58 [2.19-5.84]; P = 3.36 x 10-7} and thy- 
roid cancer {SIR 8.43 [3.16-22.45]; P = 5.96 x 
10°}. Representative melanoma-associated 
pedigree patterns associated with POTI, TINF2, 
and SMARCALI variants are shown in Fig. 4C 
and fig. S6. For two families of probands with 
long RLTL and cognate tumor patterns in 
which no causal coding variants were found 
in shelterin complex genes (Fig. 4C, lower 
panels), we did not find evidence for causal 
noncoding variants. 


Discussion 


Together, these data suggest that heritable de- 
fects in telomere and mitotic function increase 


the risk of sarcoma, in contrast to most epithe- 
lial cancers. Telomere maintenance and mito- 
sis are fundamental to chromosome integrity. 
Sarcomas are predominantly genomically un- 
stable (27) and use the alternate mechanism 
of telomere maintenance (ALT) (28). Through 
replication stress, ALT generates genomic in- 
stability that signals to TP53, which is strongly 
linked to sarcoma susceptibility (29-33). Fifty- 
two (3.2%) sarcoma probands carried C3 to 
C5 variants in the shelterin complex and rel- 
ated telomere genes, compared to 13 for TP53 
(0.8%). Pathogenic variation in these genes 
is rare in the healthy individuals, with the 
MGRB containing only one C3 to C5 variant in 
TP53 (0.03%), and six in shelterin complex 
genes (0.2%). 

Shelterin is a six-subunit protein complex 
which protects the ends of telomeres (34) and 
is associated with melanoma risk (35, 36). 
Shelterin variants occurred in 6 out of 15 (40%) 
sarcoma families with familial melanoma, in 
contrast to the gene most commonly mutated 
in familial melanoma (CDKN24A), in which no 
variants were found (16). Like 7P53, germline 
shelterin pathogenic variants exhibit a dom- 
inant pattern of inheritance and cancer risk. 
Unlike 7P53, cancer onset occurs at an older 
age, perhaps the result of an interaction with 
age-dependent telomere shortening. In addition 
to shelterin, three genes with roles in telomere 
maintenance were identified with multiple 
C4 or C5 variants: SMVARCALI, TIMELESS, and 
STAG3. SMARCALI and TIMELESS regulate 
telomere stability in ALT cells (2/7, 22, 37-40), 
and STAG3 plays roles in both centromere and 
telomere-sister chromatid cohesion (41, 42). 
Shelterin complex pathogenic variant carriers 
had long leukocyte telomeres, recently asso- 
ciated with increased risk for multiple cancers 
(32). Sarcomas, melanomas, and thyroid cancers 
may share an association with long-telomere 
syndrome (36). Notably, we did not observe 
any variants in components of the telomerase 
complex, the dominant mechanism of telomere 
maintenance in epithelial cancers. 

Overall, 2.3% of sarcoma probands (compris- 
ing 10% of MPNSTs and 8.4% of GISTs) carry 
C3 to C5 variants in centrosome genes. Centro- 
some variants were mutually exclusive with 
variants in known MPNST or GIST genes (VF7, 
LZTR1, SDHA, and SDHB), which themselves 
account for 15% of MPNSTs and 5% of GISTs 
in our study. Chromosome segregation during 
mitosis begins with centrosome assembly (43). 
Although somatic centrosome abnormalities are 
common in cancer (44), a germline role for this 
pathway is limited to CEP57. CEP57 has been 
associated with mosaic variegated aneuploidy 
(45), a recessive condition associated with chro- 
mosomal instability and increased risk of can- 
cer, including sarcomas (46). CEP63 interacts 
with CEP72 and CEP152 to mediate formation of 
the centrioles, centrosomes, and bipolar spindle 
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Table 2. Biological pathways enriched in pathogenic variants in the top-ranked 85 sarcoma-specific genes by WRVBT. The top 85 genes by WRVBT 
and protein interactions with at least one other gene were analyzed in Cytoscape using StringApp. Ontologic groups refer to relevant groups in the secondary 
gene set (Fig. 1, B and E). Term names are derived from relevant databases (Category). Genes refers to the those found related to the term name, and % term 
refers to the percentage of total genes in the term represented. The uncorrected P values and FDR values are provided. 


Other 
UniProt Keywords 


Group 1 Group 2 Group 3 


Category 


GO Component GO Component UniProt Keywords 


Spindle 


CEP63 HAUS4 HAUSS MAD2L2 TERF1 MYH1O TP53 PTCHI STARDI3 EPB4IL3 
FAM161A HEPACAM RANGAPI RACGAPI NFI RBL1 EXTI EXT2 


0.0016 


0.0021 0.0029 


Table 3. Clinical, molecular, and familial features of probands carrying C4 or C5 variants in the shelterin complex and associated genes. AS, 
angiosarcoma; CS, chondrosarcoma; FMS, fibromyxosarcoma; GIST, gastrointestinal stromal tumor; LPSWD, Well- or dedifferentiated liposarcoma; LPSM, 
myxoid liposarcoma; MPNST, malignant peripheral nerve-sheath tumor; OS, osteosarcoma; PNET, primitive neuroectodermal tumor; RMS, rhabdomyosarcoma, 
not otherwise specified; SS, synovial sarcoma; UPS, undifferentiated pleomorphic sarcoma; LFS, Li Fraumeni Syndrome; NS, no syndrome; U, uninformative. 


Second malignancies 
(ages at diagnoses) 


Thyroid (58); melanoma (41, 44, 50) 


ID Gene Protein change Gender Sarcoma _ Age at diagnosis Pedigree pattern 


017.468.1 


p.GIn94Glu IF 


LPSWD 62 Familial melanoma 


041.1598.1 


Chompret 


p.Glu848Ter F NA NS 


UPS NA 


g 
p.Arg265Ter lF 


015.1183.1 TINF2 Classic LFS 


(47). CEP63 and CEP89 also interact with HAUS4 
and HAUS5, augmin-like complex members 
that regulate centrosome integrity (48). Nota- 
bly, this centrosome group adds to the known 
genes linked to MPNST and GIST. 

The methodologies applied here have broad 
applications for disease-focused pathway dis- 
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covery in the emerging era of WGS, allowing 
mapping of rare SNV/indels and structural 
variation (SV) in large populations. In our 
study, SNV/indels (rather than SVs) appear 
the dominant source of excess pathogenic bur- 
den. In contrast to population-based controls, 
controls of healthy elderly individuals increase 


statistical power by stripping out pathogenic 
variation-causing phenotypes (such as cancer) 
that emerge later in life. Notably, the combi- 
nation of population-scale and familial cohort 
designs, historically distinct from each other, 
allows correlation of genotypes to clinical and 
familial patterns. The risk-to-relatives analyses 
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demonstrated an increased cancer incidence 
in family members of probands with shelterin- 
complex variants comparable with that seen 
for carriers of variants in TP53. Statistical 
approaches that combine gene-level pathogenic 
enrichment with pathway-based approaches 
based on protein interactions (49) and ontol- 
ogies enhance the power for biological in- 
sights into cancer pathogenesis (50, 57). These 
approaches have identified genes and biologic 
pathways that appear specific to mesenchymal 
malignancies, as well as mapping the contribu- 
tions of known cancer genes. Amongst known 
and actionable genes, the lack of enrichment 
in BRCAI or BRCA2 is notable, although some 
sarcoma families meet both HBOC and LFS 
criteria. We add 14 candidates (including TERFI, 
TINF2, TERF2IP, SMARCAL1, TIMELESS, 
STAG3, CEP63, CEP72, HAUS4, and HAUS5) 
to more than 100 known sarcoma-associated 
genes (7). Biologically, these data suggest that 
the telomeric and mitotic pathways may play 
specific roles in sarcoma susceptibility, anal- 
ogous to homologous recombination and mis- 
match repair in susceptibility to breast and 
colorectal cancers. 
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Total synthesis of lissodendoric acid A via 
stereospecific trapping of a strained cyclic allene 


Francesca M. Ippolitit, Nathan J. Adamson{, Laura G. Wonilowicz, Daniel J. Nasrallah, Evan R. Darzit, 


Joyann S. Donaldson§, Neil K. Garg* 


Small rings that contain allenes are unconventional transient compounds that have been known since the 
1960s. Despite being discovered around the same time as benzyne and offering a number of 
synthetically advantageous features, strained cyclic allenes have seen relatively little use in chemical 
synthesis. We report a concise total synthesis of the manzamine alkaloid lissodendoric acid A, which hinges 
on the development of a regioselective, diastereoselective, and stereospecific trapping of a fleeting cyclic 
allene intermediate. This key step swiftly assembles the azadecalin framework of the natural product, 
allows for a succinct synthetic endgame, and enables a 12-step total synthesis (longest linear sequence; 
0.8% overall yield). These studies demonstrate that strained cyclic allenes are versatile building blocks in 


chemical synthesis. 


eometrically constrained organic mol- 

ecules have long fascinated chemists. 

One subclass of such compounds are 

small cyclic molecules that bear func- 

tional groups of typically linear geom- 
etry. Embedding typically linear functional 
groups within small ring systems forces bent 
geometries, leading to a high degree of strain. 
Small rings that contain triple bonds were 
speculated in the early 1900s (1), ultimately 
leading to the validation of benzyne (1) (2, 3) 
and cyclohexyne (2) (4) in the 1950s (Fig. 1A) 
(5-9). Although these in situ-generated inter- 
mediates exhibit substantial strain energies of 
~40 to 50 kcal/mol (0, 12) and short lifetimes 
(12, 13), scientists have learned how to leverage 
these highly reactive intermediates in a range 
of useful chemical transformations and bio- 
logical applications. Examples of aromatic and 
nonaromatic cyclic alkyne trapping experi- 
ments can be seen in the synthesis of im- 
portant ligands, such as XPhos, medicines, 
agrochemicals, materials, naturally occur- 
ring compounds, and other areas (9). More- 
over, bioorthogonal chemistry (14), recently 
recognized by the Nobel Prize in Chemistry, 
relies on cyclooctynes (8) and other strained 
intermediates. 

Cyclic allenes, such as 1,2-cyclohexadiene 
(4), are a related class of strained molecules 
that contain a typically linear functional group 
that is constrained in a small ring (Fig. 1A). 
Cyclic allene 4 was first validated in 1964 (75) 
and again in 1966 (6). This intermediate, along 
with other cyclic allenes, has received far less 
attention compared with that of strained 
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cyclic alkynes, despite having a number of 
advantages that warrant their further investi- 
gation. Our laboratory has been interested in 
heterocyclic allenes, such as azacyclic allene 5, 
because of their properties and potential use 
in chemical synthesis (9, 17, 18). The geometry- 
optimized structure of this species is depicted 
in Fig. 1B. As shown, the allene C=C bonds are 
each 1.32 A, and the central allene carbon 
bears an internal angle of 133°. Additionally, 
the allene C-H and C-CHz bonds are twisted 
out of the allene C=C=C plane by 39° and 36°, 
respectively. Several features result from this 
distinctive structure, including the high re- 
activity of cyclic allenes due to strain; their 
capacity to undergo cycloadditions, nucleo- 
philic additions, or metal-catalyzed reactions; 
and the capability to form two new bonds in a 
single transformation, with introduction of up 
to three C(sp”) stereocenters. As such, strained 
cyclic allenes have recently been used to pre- 
pare highly substituted, sp®-rich compounds 
(9, 17-24) and have even been used to access 
DNA-encoded libraries (25). Further under- 
scoring their attractiveness is that cyclic al- 
lenes are axially chiral and could serve as 
unconventional yet valuable building blocks 
for stereoselective synthesis. Moreover, from 
a practical perspective, strained cyclic allenes 
can possess heteroatoms within the ring and 
can be accessed under mild conditions (such 
as fluoride-based reactions or ambient tem- 
peratures) (19). Last, because of the relatively 
scarce body of literature surrounding strained 
cyclic allenes, there exist numerous opportu- 
nities for discovery with regard to reactivity 
and selectivity patterns in synthetic applica- 
tions. Cyclic allenes have rarely been used in 
the synthesis of complex natural products, 
which is notable given that their close rela- 
tives, arynes and strained cyclic alkynes, have 
been used in more than 100 total syntheses to 
date (6-8). One notable example of the use of a 
cyclic allene in total synthesis was reported in 


1997 by Marshall and Sehon and involves a 
relatively unstrained allene embedded in a 
macrocyclic 14-membered ring (26). 

With the aim of developing the chemistry of 
strained cyclic allenes and evaluating their 
utility in total synthesis, we considered lis- 
sodendoric acid A (6) (27), a structurally com- 
plex member of the manzamine family of 
alkaloids (28) (Fig. 1C). Manzamine alkaloids 
have been popular targets for total synthesis 
because of their complex structures and pro- 
mising biological activities (29, 30). The cu- 
mulative synthetic efforts toward manzamine 
alkaloids, as recently highlighted by Dixon and 
Jakubec (29), have involved more than 25 re- 
search groups to date. With regard to lisso- 
dendoric acid A (6), this natural product was 
isolated in 2017 from the marine sponge 
Lissodendoryx florida and has been shown to 
reduce reactive oxygen species (ROS) in a 
Parkinson’s disease model consisting of Neuro 
2a cells treated with 6-hydroxydopamine (~50% 
reduction in ROS levels at concentrations of 
0.1 and 10 uM) (27). The natural product’s 
complex structure poses a daunting synthetic 
challenge that in turn would push the limits of 
strained cyclic allene chemistry. The core of 
lissodendoric acid A (6) is an azadecalin scaf- 
fold bearing a conjugated diene, a carboxylic 
acid substituent, and two stereogenic centers, 
one of which is quaternary at C8a. In addition, 
the natural product bears a 14-membered mac- 
rocycle tethered beneath the C8a aminodecane 
substituent. A total synthesis of lissodendoric 
acid A (6) has not been reported. 

We questioned whether the azadecalin core 
and C8a quaternary stereocenter (for example, 
7) could be introduced simultaneously by 
means of a [4+2] cycloaddition between cyclic 
allene 8 and 2-pyrone 9 (Fig. 1C) (37). The 
strain associated with cyclic allene 8 would be 
crucial to the success of the proposed reaction 
because such intermolecular cycloadditions 
of analogous cyclic alkene substrates are chal- 
lenging (32, 33). The use of 2-pyrones in [4+2] 
cycloadditions with cyclic allenes was previ- 
ously unknown but, if successful, could enable 
control of regioselectivity through an inverse 
electron-demand Diels-Alder process. In addi- 
tion, the use of a pyrone would furnish a 
[2.2.2]-bicyclic product (7), which in turn 
would function as a masked 1,3-diene, enabling 
the strategic unveiling of this potentially sen- 
sitive motif at a later stage through thermal 
expulsion of CO,. Last, we were intrigued by 
the possibility of accessing cyclic allene 8 in 
enantioenriched form and developing enantio- 
specific trapping reactions. 

Our successful total synthesis of lissoden- 
doric acid A (6) was enabled by the develop- 
ment of a regioselective, diastereoselective, 
and stereospecific trapping reaction of a 
fleeting cyclic allene intermediate. The total 
synthesis is concise owing to generation of 
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Lissodendoric Acid A(6) _- [2.2.2}-bicycle as masked diene wanging stedien 4 


Fig. 1. Strained intermediates and overview of current study. (A) Seminal strained cyclic intermediates 

1, 2, and 4 and bioorthogonal cyclooctyne reagents 3. (B) Geometry-optimized structure of strained azacyclic allene 
5 [wB97XD/6-31G(d)] and key features. CO2Me is omitted from the three-dimensional representations for clarity. 
(C) Our synthetic target, lissodendoric acid A (6), and our synthetic approach by using a [4+2] cycloaddition to 
access 7 from cyclic allene 8 and pyrone 9 (R; R’) R'"” = alkyl). 
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Fig. 2. Regioselectivity and stereospecificity studies using variable cyclic allene precursors and [4+2] 
cycloaddition partners. (A) Use of cyclic allene precursor 12 allows for efficient and stereospecific cycloadditions. 
(B) Stereospecificity is modest when using pseudo-isomeric silyltriflate 16. (C) Switching to silylbromide 18 as 
the cyclic allene precursor allows for regioselective and stereospecific cycloaddition reactions, while providing 

a plausible entryway to enantioenriched material. Highlights indicate the primary differences between cyclic allene 
precursors 12, 16, and 18. Reaction conditions are provided in the supplementary materials. 
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substantial structural complexity in the key 
cyclic allene trapping step. 


Development of regio- and 
stereoselective cycloadditions 


Because the success of our approach would 
hinge on the ability to control the regioselec- 
tivity and stereoselectivity in the key [4+2] 
cycloaddition step, we performed preliminary 
studies with furan 11 and pyrone 13 (Fig. 2). 
These dienes were selected so that we could 
compare and contrast the electronic effects of 
the cyclic allene trapping partners (electron- 
rich furan versus electron-poor pyrones). Pyrone 
13 was chosen because it was easily accessible 
and possesses potentially useful synthetic hand- 
les, whereas furan (11) had previously been 
used in cyclic allene [4+2] cycloadditions (17). 
The syntheses of 12, 16, and 18, which func- 
tion as precursors to cyclic allene 8 (Fig. 1C), 
are detailed in the supplementary materials 
and relied on chiral separation technology to 
access enantioenriched material. Treatment 
of silyltriflate 12 with furan (11) and CsF in 
acetonitrile at 23°C delivered the undesired 
regioisomer 10, which is consistent with prior 
experiments that used the N-Cbz derivative of 
12 (77). In a crucial result, we found that treat- 
ment of 12 with readily available pyrone 13 
(34) gave cycloadduct 14, which possesses the 
desired connectivity and the necessary C8a 
quaternary center, without substantial loss of 
stereochemical integrity. The reaction is endo- 
selective, which is consistent with theoretical 
studies of 1,2-cyclohexadienes (35), and occurs 
with the expected regioselectivity based on 
this being an inverse electron-demand pro- 
cess. These results demonstrate that regiose- 
lectivity in cyclic allene [4+2] cycloadditions 
can be modulated by judicious selection of the 
trapping agent. 

Although these results were encouraging, 
we were unable to access compounds such as 
12 in high enantiomeric excess (ee) without 
the use of chiral separation technologies. As 
such, we evaluated alternative precursors 16 
and 18, in which the silicon substituent would 
be placed on a less-substituted carbon, in the 
corresponding cycloaddition reactions (36). As 
shown in Fig. 2B, the use of pseudo-isomeric 
silyltriflate 16 (albeit with Et in place of Me 
owing to substrate synthesis logistics), fur- 
nished the expected cycloadducts 15 and 17. 
However, regardless of the trapping agent, 
poor stereoretention was observed for reasons 
that are not well understood presently. Last, 
we examined silylbromide 18, which is anal- 
ogous to an approach to cyclic allene genera- 
tion developed by West and coworkers (Fig. 
2C) (22). Although yields were modest in these 
initial studies, notable stereoretention was 
observed in the formation of cycloadducts 10 
and 14 (37). Silylbromide precursors to cyclic 
allenes had not been synthesized previously in 
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Fig. 3. Assembly of the azadecalin core of lissodendoric acid A (6) by using a strained cyclic allene. (A) Synthesis of pyrone 22. (B) Enantioselective route 
to silyltriflate 27. (C) Cycloaddition of the strained cyclic allene derived from 27 with pyrone 22 proceeds with regioselectivity, diastereoselectivity, and 
stereospecificity to deliver enantioenriched adduct 28 with the desired quaternary stereocenter at C8a. 


enantioenriched fashion, but we were opti- 
mistic that derivatives of 18 could be prepared 
without the need for chiral separations. Before 
describing the translation of these studies to 
the total synthesis of lissodendoric acid A (6), 
we also highlight several key features of cyclic 
allene chemistry that are reflected in the re- 
sults shown in Fig. 2. Specifically, all reactions 
proceed under mild conditions, lead to the for- 
mation of two C-C bonds, occur with high endo 
diastereoselectivity (35), and provide access to 
complex heterocyclic products containing three 
stereocenters, one of which is quaternary in 
the cases of the pyrone cycloadducts. 


Fragment synthesis and assembly of 
azadecalin core 


Having established the feasibility of the key 
cyclic allene trapping step, we sought to per- 
form analogous studies on a more elaborate 
substrate. Pursuing this endeavor could allow 
us to push the limits of strained cyclic allene 
chemistry while enabling the synthesis of lis- 
sodendoric acid A (6). Pyrone 22 was prepared 
in two steps from commercially available car- 
boxylic acid 19 (Fig. 3A). Double tosylation, 
followed by the addition of tBuOH, furnished 
tosylate 20, bearing a t-butyl ester substituent. 
Subsequent Negishi coupling with organozinc 
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reagent 21 provided the alkylated pyrone 22, 
which contains a terminal alkene necessary 
for the eventual installation of the macrocycle 
by using ring-closing metathesis. The desired 
cyclic allene precursor, silylbromide 27, was 
prepared from bromotriflate 23, which was 
obtained commercially (Fig. 3B). Treatment 
of 23 with alkylzirconium reagent 24 gave 
ketone 25 through 1,4-addition and ejection of 
the triflate leaving group (38). Corey-Bakshi- 
Shibata (CBS) reduction (39), followed by treat- 
ment with ethylchloroformate, gave 26 in 90 
to 92% ee. Displacement of the carbonate with 
a silyl cuprate nucleophile (22, 40, 41) de- 
livered cyclic allene precursor 27 without loss 
in enantioenrichment. This step was thought 
to proceed with inversion of stereochemistry 
on the basis of prior studies by Oestreich (41). 
The conversion of bromoketone 25 to silyl- 
bromide 27 parallels a general racemic strat- 
egy pioneered by Lofstrand et al. (22), but the 
enantioselective reduction and stereospecific 
displacement of the carbonate had not been 
demonstrated previously. 

With pyrone 22 and cyclic allene precursor 
27 in hand, we evaluated the key [4+2] cyclo- 
addition (Fig. 3C). We found that by simply 
treating these reactants with CsF, the desired 
cycloadduct, 28, could be obtained. Through 


optimization, we found that the optical yield 
(42, 43), or percent stereoretention, was great- 
ly improved at lower reaction temperatures, 
which in turn required the use of Bu,NBr to 
improve the solubility of fluoride ion in solu- 
tion. Ultimately, -20°C was found to be the 
optimal temperature, enabling the formation 
of 28 in 73% isolated yield, presumably by 
means of the depicted transition structure 
proposed on the basis of prior computational 
studies of cyclic allene Diels-Alder reactions 
(17, 35, 44). 

Several aspects of this complexity-generating 
step should be noted, especially the regiose- 
lectivity and stereoselectivity. The regioselec- 
tivity is thought to depend on both the cyclic 
allene and pyrone components. First consid- 
ering the cyclic allene component, either olefin 
of the allene could potentially undergo trap- 
ping; however, the more electron-rich sub- 
stituted olefin of the cyclic allene engages with 
the electron-deficient pyrone, presumably be- 
cause of the inverse electron-demand nature 
of this cycloaddition. With regard to the pyrone 
component, the diene in pyrone 22 aligns with 
the cyclic allene to promote bond formation 
between C8a and C8. The pyrone carbonyl, 
which is conjugated to the diene in 22, is 
thought to provide a dominant electronic effect 
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Fig. 4. Completion of the total synthesis of lissodendoric acid A (6). 


that leads to this selectivity. The origins of 
stereoselectivity in this transformation are 
also notable. The reaction is thought to pro- 
ceed in an endo fashion, with favorable orbital 
overlap between the diene and the nonreac- 
tive olefin of the cyclic allene, and with favor- 
able approach of the pyrone opposite the C8a 
alkyl substituent. This gives rise to 28 in >20:1 
diastereomeric ratio. The endo approach and 
facial selectivity are consistent with prior com- 
putational studies of cyclic allene Diels-Alder 
reactions (17, 35). Despite that four cyclo- 
adducts could be envisioned (not accounting 
for enantiomers discussed below), cycloadduct 
28, which bears considerable structural com- 
plexity and functionality, is formed selectively 
as the sole observable product of the reaction. 

The absolute stereochemistry and optical 
yield also warrant particular attention. Cyclic 
allene precursor 27 bears a single stereocenter 
(Fig. 3B, asterisk). Upon generation of the 
cyclic allene intermediate, point chirality in 
27 is transmitted to axial chirality in the cyclic 
allene intermediate. Then, cycloaddition rein- 
troduces point chirality, with the generation of 
two tertiary stereocenters and the C8a quater- 
nary stereocenter. The sole stereocenter pres- 
ent in reactant 27 is ablated in this process. 
The optical yield (42, 43), or stereoretention, 
in forming 28 at -20°C is 88%, indicating that 
strain-driven cycloaddition between two high- 
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ly substituted reactants is favorable over facile 
enantiomerization of the strained cyclic allene 
at low temperature. The barrier for enantio- 
merization for related cyclic allenes has been 
estimated to be ~14 kcal/mol (45). 


Completion of total synthesis 


To complete the total synthesis of lissoden- 
doric acid A (6), we developed the concise late- 
stage sequence shown in Fig. 4 using (+)-28. 
The first challenge was to introduce the C4a 
hydrogen substituent chemoselectively, with- 
out reduction of the other olefins. The C4/C4a 
olefin proved recalcitrant toward a range of 
reduction conditions; however, we ultimately 
achieved a selective reduction by first oxidiz- 
ing C3 with pyridinium dichromate (PDC)/ 
t-BuOOH (46, 47) and then treating the result- 
ing o,f-unsaturated lactam intermediate with 
a copper hydride source (48) to effect di- 
astereoselective 1,4-reduction and furnish 
cis-azadecalin 29. The CO, bridge embed- 
ded in the [2.2.2]-bicycle of 28 (installed in the 
aforementioned cyclic allene trapping) was 
important for the success of this sequence 
because it masked the 1,3-diene motif that 
would otherwise be in conjugation with the 
C4/C4a olefin. Next, we arrived at tetraene 30 
through a sequence involving thermal extru- 
sion of CO, (49), copper triflate-promoted re- 
moval of two of the three Boc groups (50), and 


amidation of the piperidine nitrogen with 
acryloyl chloride. Access to 30 set the stage 
for macrocyclization, which we achieved using 
ring-closing metathesis promoted by the robust 
Grubbs’ 2nd Generation catalyst (Sigma-Aldrich) 
(51, 52). The desired macrocycle 31 was ob- 
tained in 83% yield and as an inconsequential 
mixture of E:Z olefin isomers (ratio ranging 
from 2:1 to 3.5:1). All that remained to com- 
plete the total synthesis was to selectively 
reduce the a,-unsaturated imide and perform 
a global deprotection. We achieved the former 
using a reduction protocol reported by Das 
et al. (53), which led to saturation of the C3 
amide and the C10 o,f-unsaturated amide 
selectively without reduction of the ester or 
diene functional groups. Although the overall 
yield of this transformation is modest (38%), 
given that three functional groups are being 
reduced (~’72% yield per reduction), we deemed 
the yield acceptable. Last, treatment of the 
reduced intermediate with trifluoroacetic acid 
furnished lissodendoric acid A (6). The total 
synthesis is 12 steps from 23 (longest linear 
sequence; 0.8% overall yield). 

Another matter of interest is the absolute 
stereochemical configuration of lissodendoric 
acid A. On the basis of experimental and com- 
putational circular dichroism studies, the iso- 
lation chemists had postulated the absolute 
configuration of (-)-6 to be as we have de- 
picted throughout this study (27). To validate 
this hypothesis, our enantioenriched material 
was elaborated to the natural product. The 
circular dichroism spectrum for our synthetic 
sample matched that of a natural sample. We 
conclude that the natural configuration of 
(-)-6 proposed by the isolation chemists was 
indeed correct. 

These studies demonstrate that cyclic allenes 
are powerful tools for complex molecule synthe- 
sis and should prompt the further investigation 
and application of these unusual compounds. 
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LIGHT POLLUTION 


Citizen scientists report global rapid reductions in 
the visibility of stars from 2011 to 2022 


Christopher C. M. Kyba,'2* Yigit Oner Altintas,'+ Constance E. Walker,> Mark Newhouse* 


The artificial glow of the night sky is a form of light pollution; its global change over time is not well 
known. Developments in lighting technology complicate any measurement because of changes in lighting 
practice and emission spectra. We investigated the change in global sky brightness from 2011 to 

2022 using 51,351 citizen scientist observations of naked-eye stellar visibility. The number of visible 
stars decreased by an amount that can be explained by an increase in sky brightness of 7 to 10% per 
year in the human visible band. This increase is faster than emissions changes indicated by satellite 
observations. We ascribe this difference to spectral changes in light emission and to the average angle 


of light emissions. 


ver much of Earth’s land surface, the 
night sky no longer fully transitions to 
starlight and moonlight after sunset (7). 
Instead, the sky also glows with an arti- 
ficial twilight caused by the scatter of 
anthropogenic light in the atmosphere (2). 
The radiance of skyglow grew exponentially 
for much of the 20th century (3) as a result of 
population growth, expansion of settlements, 
and deployment of new lighting technologies 
(4, 5). The character of the night sky is now 
different from what it was when life evolved 
and civilization developed. 
Many of the behavioral and physiological 
processes of life on Earth are connected to 
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daily and seasonal cycles. For example, visual 
predation requires sufficient light to see, and 
predator-prey interactions are therefore ex- 
pected to be affected by skyglow (6). There 
are few controlled field studies of the eco- 
logical impacts of skyglow, but it has been 
have shown to affect plants, animals, and 
their interactions (7), and laboratory studies 
have demonstrated changes in the physiol- 
ogy of fish at skyglow-like nighttime il- 
luminance of 0.01 lux (8). In addition to its 
environmental consequences, skyglow lim- 
its human observation of starry skies and 
the Milky Way. The increase in skyglow has 
affected human culture (9), not only by re- 
stricting stargazing and astronomy but also 
by changing the overall appearance of the 
night sky. 

Effective methods for reducing light pollu- 
tion are well understood (10, 17), and many 
of them also reduce electricity consumption. 
These measures have been implemented on 
local scales (72, 13) but have not seen wide- 
spread adoption. Nevertheless, awareness of 
light pollution has led some policy-makers to 
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introduce measures that attempt to control 
light pollution (14). 

During the 2010s, many outdoor lights were 
replaced by light-emitting diodes (LEDs): 
Global LED market share for new general 
lighting grew from under 1% in 2011 to 47% in 
2019, and LED market share for new outdoor 
lighting in the United States was 66% in 2020 
(15). The impact on skyglow from this transi- 
tion to LEDs is unclear. Some researchers have 
predicted that it will be beneficial (76); others, 
that it could be harmful because of spectral 
changes (17) or a rebound effect (78), in which 
the high luminous efficacy (more light emitted 
for a given power) of LEDs leads to more or 
brighter lights being installed or longer hours 
of operation. 

The generation of skyglow and changes in 
its character are related to social, economic, 
and technological processes, and we therefore 
expect skyglow trends to differ within and 
across countries. Time-series measurements 
of skyglow from individual sites, although use- 
ful for some purposes, might not be represen- 
tative of how skyglow is changing on larger 
scales. It would therefore be beneficial to mea- 
sure changes in skyglow on continental and 
global scales. 

In principle, it is possible to directly mea- 
sure skyglow through satellite observations 
of Earth at night (79). Unfortunately, the only 
satellite instruments that currently monitor 
the whole Earth have limited resolution and 
sensitivity and cannot detect light with wave- 
lengths below 500 nm (20). This is a problem 
for three reasons: (i) Shorter wavelengths 
scatter more effectively in the atmosphere, 
increasing the chance that a photon emitted 
upward returns to Earth as skyglow (2); (ii) 
LEDs marketed as white usually have an emis- 
sion peak between 400 and 500 nm, where the 
satellite sensor is insensitive; and (iii) human 
visual sensitivity shifts toward shorter wave- 
lengths at night (77). The first two effects could 
mean that changes in ground radiance ob- 
served by satellite (2/7, 22) differ from changes 
in skyglow. The third effect means that ground- 
based radiometers (and photometers) face a 
similar problem: If skyglow darkens at longer 
wavelengths but brightens at shorter wave- 
lengths, it would be unclear whether the num- 
ber of stars visible to humans would increase or 
decrease (23, 24). 

We analyzed a citizen science dataset in 
which the human visual system was directly 
used as a sensor (24, 25). In the Globe at Night 
project [operated by the National Optical- 
Infrared Astronomy Research Laboratory 
(NOIRLab)], participants are presented with 
a set of star maps (example shown in fig. S1) 
and asked which one best matches the night 
sky at their location (26). This provides an 
estimate of the naked eye limiting magnitude 
(NELM), the visual apparent magnitude of 
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the faintest star that can be seen. Astronom- 
ical magnitudes are an inverted logarithmic 
scale, so the NELM is smaller for brighter 
skies. The NELM is related to skyglow because 
as background radiance increases, faint point 
sources of light become invisible (27). The lim- 
iting magnitude estimated by citizen scientists 
using this method correlates with the loca- 
tions of skyglow determined using satellite 
datasets (24, 25). We grouped different regions 
of the globe according to their sky brightness 
in 2014, as determined by the (satellite-based) 
World Atlas (7), to examine how the NELMs 
in similarly bright areas change over time 
(Fig. 1 and fig. $2). 

Our method (26) accounts for differences in 
the set of citizen scientists participating each 
year (Fig. 2) and allows us to measure changes 
in stellar visibility on global or continental 
spatial scales. The overall number of obser- 
vations limits the spatial and temporal scales 
over which trends can be determined (partic- 
ularly for developing countries, where rapid 
skyglow change is suspected but little obser- 
vational evidence is available). The Globe at 
Night data have a spatial bias toward Europe, 
North America (especially the United States), 
and a small number of other countries; there 
is also a bias toward inhabited areas. For ex- 
ample, 50.6% of the Asian contributions are 
from Japan, and contributions from Australia 
are overwhelmingly from coastal areas (Fig. 2B 
and fig. S3) during 2020 (Fig. 2A). Because 
Europe and North America have sufficient 
data in both time and space, we report trends 
for those continents and combine all the others 
(referred to as Rest of World hereafter). 

Although this dataset does not represent an 
average of either the land area or human pop- 
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ulation distribution, participants are concen- 
trated in regions where skyglow is most prev- 
alent (J). The global trend in skyglow that we 
measure likely underestimates the trend in 
countries with the most rapid increases in 
economic development, because the rate of 
change in light emission is highest there (27) 
and we expect the addition of new lights to 
have a greater impact on skyglow than would 
result from the replacement of existing lights. 

To convert the NELM measurements to 
rates of change in effective skyglow radiance 
for human vision, we fitted a model to a subset 
of the dataset [observations from January 2011 
to August 2022, without twilight, moonlight, 
or reported snow on the ground (26)], using a 
maximum likelihood method. There are five 
free parameters in the model: (i) the average 
limiting magnitude reported in regions with 
no light pollution (y intercept in Fig. 1); (i) 
the slope of the relationship between NELM 
and World Atlas skyglow radiance (similar to 
the slope in Fig. 1, but with a time-adjusted 
radiance); (iii) the annual rate of change in 
artificial skyglow; (iv) the standard deviation 
in the residuals between measured and pre- 
dicted NELM; and (v) an estimate of the error 
rate in the Globe at Night dataset (e.g., due 
to participants reporting their location or 
NELM incorrectly). 

We find that the change in the number of 
visible stars reported by Globe at Night par- 
ticipants is equivalent to a 9.6% per year an- 
nual increase in sky brightness, averaged over 
the locations of participants (Table 1). For an 
18-year period (such as the duration of a hu- 
man childhood), this rate of change would 
increase sky brightness by more than a factor 
of 4. A location with 250 visible stars would see 


All Years 
BM 2011 
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Fig. 1. Naked eye limiting magnitude estimated by Globe at Night participants as a function of the 
night sky brightness in 2014. The sky brightness factor (SBF) is the ratio of total radiance to natural 

sky radiance, so SBF = 1 indicates starlight and SBF = 10 (plotted as logjgSBF = 1) indicates that the sky is 

10 times as bright as starlight (plotted as log;gSBF = O and 1, respectively) (26). The relationship is shown for 2011 
(green squares), 2021 (blue triangles), and the average of all years from 2011 to 2022 (gray circles). Smaller 
NELM values mean that fewer stars are visible. Lines indicate linear models fitted to the data for logigSBF >0.5, 
which corresponds to ~3 times as bright as starlight. Shaded regions show the 95% confidence interval. 
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a 
Table 1. Summary of results for different geographic areas. Best-fitting values of our model are listed for three continental groupings and the global 
average. f is the rate of increase, N, is the NELM for natural skies, § relates NELM to the World Atlas, ois the standard deviation of residuals, and E is the error 
rate (26). Uncertainties are +1 standard deviation and are statistical only. Rest of World includes four continents because of insufficient coverage (see text). 
More detailed results, before combining continents, are listed in table S1. 


Region fF (%) 


Europe 


Global average 


N, (NELM) $s 
4.77 + 0.03 -1.33 + 0.03 


c (NELM) 
0.953 + 0.012 


E (%) 


Rest of World EOF 466 + 0.03 Neto AY Bc Ren RE 7137 + 0.02 I Me ete IEE LE Sr ee ROR RE OR vies 
4.825 + 0.014 -1.429 + 0.014 1.022 + 0.006 21402 
A 24000 


that number reduce to 100 visible stars over 
the same period. Because our method uses 
measurements made with human vision, it 
accounts for changes in both the radiance 
and spectrum of the night sky. 

We confirmed this finding by performing 
an alternative analysis of the data. Instead of 
using a maximum likelihood method to fit a 
model of the skyglow change, we performed 
least-squares fitting of a linear model to the 
monthly (observed minus expected) NELM 
residuals under the assumption that skyglow 
remains constant (Fig. 3A) or increases at a 
rate of 9.6% per year (Fig. 3B). The best- 
fitting rate of change in NELM residuals is 
-0.044 + 0.007 magnitude per year for the 
uncorrected model and 0.001 + 0.007 magni- 
tude per year for the model corrected for 
skyglow increase. The even distribution of 
points above and below zero in the corrected 
model shows that the trend is not being 
driven by outliers at the start or end of the 
analysis period (Fig. 3B). 

The rate of skyglow increase that we find is 
much larger than the rates of growth in light 
emissions observed by satellite in the 500- to 
900-nm band, which were 2.2% per year glo- 
bally during 2012-2016 (27) and >1.6% per 
year during 1992-2017 (with the possibility 
of faster increases in the visual band from 
2012 onward) (22). For a more direct compar- 
ison, we analyzed the satellite radiance trends 
at the Globe at Night observation locations 
during 2014-2021 (26). Even after controlling 
for the locations, rates of change in upward 
radiance measured by satellite were also much 
smaller than our calculated rate of skyglow 
increase (table S2). For example, for North 
America during 2011-2021, we find the rate 
of increase in skyglow radiance is 10.4 + 
0.5%, compared to —0.80 + 0.04% in the sur- 
face radiance measured by satellite at these 
locations during 2014-2021 (table S2). We 
ascribe the smaller satellite radiance change 
at Globe at Night locations compared to that 
found in other studies (21, 22) to the ten- 
dency of Globe at Night participants to observe 
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Fig. 2. Participation in Globe at Night from January 2011 to August 2022. (A) Bar chart showing the 
temporal distribution. Participation for each year is subdivided by continent, as indicated in the legend. 
(B) Spatial distribution of all years combined, on an equal-area Eckert IV map projection. Colors indicate 


the participation normalized by land area, on a logarithmic scale. Black points show the locations of 
individual observations (more visible in fig. S3). Some of the largest countries have been divided into 


smaller jurisdictions. 


from residential areas (see materials and 
methods). 

These different results are not incompatible, 
because there are several differences between 
observing surface radiance with satellites and 
sky radiance as seen by humans on the ground. 
For example, the widespread conversion of 


streetlights from gas discharge lamps to LEDs 
(16, 28) could result in spectral changes that 
affect the two datasets differently, as discussed 
above. If luminance is maintained after con- 
verting street lighting to LEDs, the spectral 
shift toward shorter-wavelength (bluer) light 
causes the radiance observed by the satellite to 
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Fig. 3. Difference from expected limiting magnitude under different skyglow growth models. The monthly residuals (observed minus expected) between 

reported NELM and the expectation based on the World Atlas (Fig. 1) under two models: (A) the standard 2014 World Atlas; (B) the World Atlas radiance multiplied by 
an exponential increase of 9.6% per year relative to January 2014 (26). Positive residuals indicate that observers reported more stars than expected. Error bars show 
the standard error, and data points before 2014 include higher numbers of observations than those made after 2014. Black lines show a linear model fitted to the 
residuals, and the shaded region shows the 95% confidence interval. 


decrease (17). By contrast, skyglow mesopic 
luminance after installation of LEDs could 
potentially either increase as a result of in- 
creased atmospheric scattering of blue light 
(2, 29) or decrease as a result of improved 
lighting fixtures that reduce horizontal emis- 
sion (16). 

The contributions to ground-observed sky- 
glow and satellite-observed surface radiance 
depend on the lighting type. Most satellite 
radiometers have little to no sensitivity to 
light emitted toward the horizon (e.g., from a 
window or self-luminous sign) (30). However, 
light propagating toward the horizon is the 
largest contributor to skyglow because of its 
longer path length (by an order of magnitude) 
from ground to space at such angles. In the 
early evening, a large fraction of the light that 
escapes cities is emitted by sources other than 
streetlights (37). Some of these lighting appli- 
cations, such as decorative and advertising 
lighting, produce a larger fraction of horizon- 
tally propagating light than modern street 
lighting does. It is therefore likely that some of 
the differences between the rates of change for 
skyglow that we calculate and those estimated 
from satellite data arise from changes in light- 
ing practices or deployment. 

We draw two conclusions from these re- 
sults. First, the visibility of stars is deterio- 
rating rapidly, despite (or perhaps because of) 
the introduction of LEDs in outdoor lighting 
applications. Existing lighting policies are 
not preventing increases in skyglow, at least on 
continental and global scales. Second, the use 
of naked-eye observations by citizen scientists 
provides complementary information to the 
satellite datasets. 
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BEHAVIORAL ADAPTATION 

Phenotype-environment mismatch errors enhance 
lifetime fitness in wild red squirrels 

Lauren Petrullo'*, Stan Boutin’, Jeffrey E. Lane®, Andrew G. McAdam‘, Ben Dantzer">* 


Mismatches between an organism's phenotype and its environment can result in short-term fitness 
costs. Here, we show that some phenotype-environment mismatch errors can be explained by 
asymmetrical costs of different types of errors in wild red squirrels. Mothers that mistakenly increased 
reproductive effort when signals of an upcoming food pulse were absent were more likely to correctly 
increase effort when a food pulse did occur. However, mothers that failed to increase effort when cues of 
an upcoming food pulse were present suffered lifetime fitness costs that could only be offset through 


food supplementation. In fluctuating environments, such phenotype-environment mismatches may 
therefore reflect a bias to overestimate environmental cues and avoid making the costliest error, 


ultimately enhancing lifetime fitness. 


n organisms ability to match its pheno- 
type to environmental conditions is cen- 
tral to hypotheses related to the evolution 
of adaptive phenotypic plasticity (d). 
Organisms producing phenotypes that 
are adaptive in a particular environment 
(ie., a “match”) can enhance their reproductive 
success and survival. Alternatively, organisms 
that mismatch their phenotypes to environ- 


A 


Litter size plasticity in red squirrels 


mental conditions often suffer fitness costs. 
For example, birds that mistime breeding to a 
fluctuating food supply experience poorer off- 
spring growth (2) and recruitment (3). Such 
mismatches have historically been viewed as a 
cost of phenotypic plasticity (4). Yet, emerging 
research demonstrates that mismatched pheno- 
types may also promote long-term population 
persistence (5, 6). Organisms may therefore 
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Fig. 1. Litter size plasticity generates phenotype—environment mismatches. 
(A) Yukon red squirrels inhabit a resource pulse system where fluctuations in new 
food production (no pulse = nonmast year, pulse = mast year) combine with 
variation in litter sizes (large = three or more pups, small = less than three pups) to 
create matched (true positive, true negative) and mismatched (false positive, false 
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aim to manage the costs and potential bene- 
fits of phenotype-environment mismatches 
over their life course to maximize fitness. 
Existing theory about how organisms bal- 
ance the costs of making different types of 
errors can help explain the occurrence of mis- 
matches (7-9). For example, in the presence or 
absence of information about the future envi- 
ronment, an organism may respond or not re- 
spond, creating a minimum of four conditions 
(two matches, two mismatch errors). Each type 
of mismatch error is expected to have a distinct 
fitness cost, and organisms should err toward 
making the least costly of the two (7). For a 
mismatched phenotype to be adaptive, the costs 
of not responding to the cue (false negative) 
should be greater than the cost of responding 
erroneously in the absence of the cue (false 
positive) (7). Empirical tests of this framework 
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Factorial framework 
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negative) phenotypic responses. Some females that produced small litters in 
mast years went on to produce a second litter later that year (false negativet), 
potentially mitigating the costs of an initial reproductive mismatch. (B) A factorial 
framework showing frequency rate of each response category (percentage of 
litters falling into each category in a given year) in parentheses (N = 2729 litters). 
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are rare because they require a system where 
individuals experience within-generation en- 
vironmental fluctuations [fine-grained envi- 
ronments (J0)] and where lifetime fitness can 
be quantified to determine the net effects of 
matches and mismatches. Individuals must 
also receive reliable cues about the future en- 
vironment (17), and phenotypic responses must 
vary enough to permit a factorial assessment 
of relative fitness costs across all conditions. 


Assessing the costs and benefits of 
mismatching in a resource pulse system 


Here, we used a long-term study of wild North 
American red squirrels (Tamiasciurus hudsonicus) 
living in the Yukon, Canada, to probe whether 
phenotype-environment mismatches can be 
adaptive. Red squirrels in this region have been 
monitored continuously since 1987 (12). Using 
31 years of life-history and fitness data from 
over 1000 female squirrels, we tested whether 
reproductive phenotype-environment mismatch 
errors allow females to avoid making the costliest 
type of error and thus enhance lifetime fitness. 
Yukon red squirrels inhabit a resource pulse 
ecosystem where they experience profound 
fluctuations in food availability as a result of 
a swamp-and-starve strategy by their primary 
food source, seeds from the cones of masting 
white spruce (Picea glauca) trees (13). Squirrels 


A 


store spruce cones containing seeds under- 
ground in middens and rely on these resources 
in lieu of bodily fat reserves (14). In nonmast 
years, few to no new seeds are produced, in- 
traspecific competition for territories is high, 
and offspring overwinter survival is at its low- 
est (15). In these nonmast years, litter sizes 
experience stabilizing selection around a popu- 
lation mean of 3.0 pups (/6) (fig. S1), where the 
fecundity benefit of producing more pups is 
balanced by the benefit of producing small 
litters of fast-growing offspring that can ac- 
quire territories and survive winter (6, 17). By 
contrast, in mast years [approximately every 4 
to 7 years (13)], a superabundance of cones is 
produced in the autumn, leading to higher off- 
spring survival, and directional selection favor- 
ing increased litter sizes [optimum litter size = 
5.2 pups (16); population mean = 3.5 pups (fig. 
S1)]. Females use ecological cues to adaptively 
increase litter sizes in the months before the 
mast occurs, capitalizing on the future food 
boom by recruiting more pups into the breed- 
ing population (tables S1 and S2) (6). These 
increases in litter size may be caused by phys- 
iological responses to phytohormonal cues of 
an upcoming mast (J8) but not by energetic 
differences (19) or differences in the num- 
ber of hoarded spruce cones in an individual’s 
midden (table S3). 


Short-term fitness 
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Fig. 2. The fitness costs of mismatch errors are asymmetrical, cumulative, 
and ameliorated by food supplementation. (A) In nonmast years, the difference in 
annual reproductive success (number of pups recruited into the breeding population 
that year) between control females that produced large (false positive) and small 
(true negative) litters was negligible. In mast years, control females that failed 

to respond to mast cues and produced small litters (false negative) exhibited 
significantly lower annual reproductive success than those that produced large litters 
[true positive (tables S4 and S5), N = 2729 litters]. Violin plots show raw data; inset 
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Most females responded to mast cues by 
increasing litter sizes before the food pulse 
(true positive, 53.4%), but others mismatched 
by failing to increase litter size (false negative, 
23.1%) or by increasing litter size in the absence 
of mast cues [false positive, 26.7% (Fig. 1)]. 


Divergent costs of phenotype-environment 
mismatching 


When an upcoming mast was imminent, 
females that failed to respond (false negative) 
suffered a 97% decrease in their annual repro- 
ductive success (recruiting only 1.07 + 0.09 pups) 
compared to females that responded by in- 
creasing litter sizes [true positive, recruiting 
2.11 + 0.10 pups (Fig. 2A and tables S4 and 
S5)]. In nonmast years, females producing 
large litters (false positive) had annual repro- 
ductive success similar to that of females pro- 
ducing small litters [true negative (Fig. 2A and 
tables S4 and S5)], suggesting a marginal or 
otherwise undetectable cost of false positive 
errors (20). A small number of females (6.6%) 
produced a second successful litter in a mast 
year. When litter sizes were small but still a 
part of this multilitter strategy (false negative+), 
mothers minimized, but did not eliminate, the 
costs of a false negative error by having an 
additional litter in that year (Fig. 2A and tables 
S4 and S5). 


Lifetime fitness 
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Rate ratio (lifetime reproductive success) 
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boxplots depict median and outliers. (B) There was no cumulative cost to maternal 
lifetime fitness (number of total pups recruited into the breeding populaton over the 
life course) of repeated overestimation of mast cues (false positive), but each missed 
mast cue (false negative) significantly decreased lifetime fitness among control but not 
food-supplemented females (N = 1055 females). Filled circles show point estimates of 
rate ratios (relative difference). Bars denote 95% confidence intervals (Cls). Estimates 
>1.0 indicate a positive effect on lifetime reproductive success; estimates <L.0 indicate a 
negative effect. Asterisks denote significance (*P > 0.05, **P > 0.01, ***P < 0.001). 
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Because squirrels breed annually and fit- 
ness accrues with each breeding attempt, we 
assessed whether mismatched responses had 
cumulative effects on female lifetime fitness. 
Although each true positive (responding in 
presence of cue) produced across the life span 
increased maternal lifetime reproductive suc- 
cess by 12.5%, each false negative (failure to 
respond in presence of cue) mismatch de- 
creased lifetime reproductive success by 30% 
(Fig. 2B and table S6). Females would there- 
fore have to produce three true positive matches 
to recuperate the lifetime fitness costs of miss- 
ing a single mast cue, an impossibility given 
the frequency of mast events [once every ~4 to 
6 years (fig. S5)] and the maximum red squir- 
rel life span [8 years, median = 3.5 years (27)]. 


Avoiding the costliest error benefits mothers 
but not offspring 


The rare nature of spruce mast events provides 
additional ecological context for the mainte- 
nance of false positive mismatches. Females that 
did not experience a mast event before dying 
(54.2%) exhibited considerably lower lifetime 
reproductive success than females that did (table 
$2). The uncertainty of encountering a mast during 
asquirrel’s lifetime, coupled with the importance 
of mast years for maximizing lifetime fitness, 
should favor individuals that are highly sensi- 
tive to mast cues even if they mismatch in years 
when cues are absent (8, 22). Indeed, each false 
positive error produced over the lifetime in- 
creased a female’s probability of avoiding the 
costliest error (false negative) by 67% (Fig. 3A and 
table S7). Females therefore benefit from an 
error-avoidance bias (7) or adaptive gullibility 
(8), such that overestimating cues and respond- 
ing when cues are absent reduces the likelihood 
of failing to respond when cues are present. 
Using data on 8455 squirrel pups born to 
these females, we found that this error- 
avoidance bias, despite enhancing maternal 
lifetime fitness, came at a cost to offspring. 
Although females that produced large litters 
in nonmast years (false positive) reduced the 
likelihood that they made a mismatch error in 
a mast year (Fig. 3A), their pups grew slower 
[B = -0.15 + 0.07, P = 0.03 (Fig. 3B and tables 
S8 and S9)] and were less likely to survive 
their first winter, thus failing to contribute 
to the breeding population [f = -1.26 + 0.44, 
P = 0.005 (Fig. 3C and tables S10 and S11)]. 
Explaining why traits that increase maternal 
fitness but decrease offspring fitness (i.e., 
“selfish maternal effects”) exist has been con- 
tentious (23). Here, we show that traits pro- 
moting maternal fitness at the expense of 
offspring fitness, such as overestimating envi- 
ronmental cues, may be more likely to evolve 
in systems where mothers must manage asym- 
metrical costs of reproductive mismatch errors. 
It is currently unclear why heterogeneity in 
female reproductive responses to cues of spruce 
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Fig. 3. Overestimating mast cues enhances maternal lifetime fitness despite costs to offspring. 


(A) Each false positive error (erroneously producing a 


large litter in a nonmast year) made across a control 


female's lifetime significantly decreased her probability of making the costliest error [false negative (Fig. 2B), 
N = 361 squirrels]. Plot shows predicted probabilities from a binomial model (table S7); ribbon indicates 


95% Cls. (B) In nonmast years, when new food is low 


and territory competition is high, pups born into large 


litters (false positive) had poorer postnatal growth [true negative (tables S8 and S9), N = 5048 pups] 
and (C) were less likely to survive their first winter than pups born into small litters [true negative (tables S10 


and S11), N = 8455 pups]. Pups of supplemented fem 


ales that made false positive errors did not suffer a 


survival cost. Asterisks denote significance (*P > 0.05, **P > 0.01, ***P < 0.001). 


masting persists in this population, but it may 
reflect variation in the physiological integra- 
tion of mast cues. Being born in a mast year 
did not affect a female’s ability to respond to 
mast cues in the future (table $12). However, 
first-time (primiparous) mothers were ~1.5 times 
less likely to make a litter-size mismatch error 
than were multiparous mothers [B = —0.42 + 
0.17, P = 0.01 (fig. S3 and table $12)]. This may 
reflect parity-dependent differences in neuro- 
endocrine sensitivity to phytohormonal (es- 
trogenic) mast cues (18); it may also indicate 
that, because most females breed only once 
in their lifetime (fig. S2A), first-time mothers 
have more to lose by mismatching what may 
be their only attempt at contributing to the 


gene pool. 


Food supplementation modifies patterns of 
error avoidance 

In addition, environmental fluctuations that 
are episodic but predictable can create well- 
adapted systems in which the manipulation of 
resources is necessary to uncover mismatch 
costs. We tested the hypothesis that providing 
squirrels with additional food would disrupt 
selection for an error-avoidance bias by reduc- 
ing the fitness costs of mismatching. To do 
so, we provided ad libitum peanut butter to 
females living on a nearby but distinct experi- 
mental study area for 14 consecutive years. In 
mast years, food supplementation relaxed se- 
lection for larger litters, and directional selec- 
tion on litter size was negative rather than 
positive [B = —0.26 + 0.12, P = 0.03 (table S13)]. 
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Providing an additional food source dimin- 
ished and in some cases reversed the costs of 
mismatch errors. Producing a large litter in the 
absence of a cue (false positive) was associated 
with an increase in maternal annual repro- 
ductive success (Fig. 2A and tables S4 and 
S5) and cumulatively positive effects on life- 
time reproductive success (Fig. 2B and table 
S6). Supplemented females who failed to re- 
spond to mast cues (false negative) incurred 
short-term fitness costs similar to those incur- 
red by control females, but without any cumu- 
lative cost over the life span (Fig. 2B and table 
S6). An amelioration of the trade-off between 
offspring quality and quantity appears to be 
the mechanism that mitigated these costs. In 
nonmast years, pups born in large litters (false 
positive) on the food-supplemented study area 
did not exhibit poorer survival as compared to 
pups born into small litters [true negative (Fig. 
3C and tables S10 and S11)]. 

In environments where selective pressures 
are unpredictable, producing a mix of matched 
and mismatched phenotypes (i.e., bet-hedging) 
can maximize fitness (24). By contrast, in envi- 
ronments where selective pressures are predict- 
able, adaptive plasticity is expected to evolve in a 
manner dependent on the spatiotemporal grain 
of the environment (25). Organisms inhabiting 
predictable coarse-grained environments (across- 
generation fluctuations) should exhibit polymor- 
phisms that are environmentally induced (26) 
or genetically determined (27). Organisms in- 
habiting predictable fine-grained environments 
(within-generation fluctuations), such as Yukon 
red squirrels, are instead expected to exhibit 
adaptive phenotypic plasticity (10). 

What has remained unclear is why pheno- 
types that are mismatched to environmental 
conditions persist. Mismatch errors can occur 
because the cues that organisms receive about 
their future environment are imprecise or un- 
reliable (17), a growing phenomenon in the 
wake of global anthropogenic climate change 
(28). Or as our results indicate, mismatches 
can persist because organisms must balance 
asymmetrical costs of different types of errors 
to avoid making the costliest error over their 
lifetimes. 
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Nanobody-driven signaling reveals the core receptor 
complex in root nodule symbiosis 


Henriette Riibsam’+, Christina Krénauer'}, Nikolaj B. Abel’, Hongtao Ji*@, Damiano Lironi’, 
Simon B. Hansen’, Marcin Nadzieja’, Marie V. Kolte’, Dérte Abel’, Noor de Jong’, Lene H. Madsen’, 
Huijun Liu’, Jens Stougaard’, Simona Radutoiu’, Kasper R. Andersen'* 


Understanding the composition and activation of multicomponent receptor complexes is a challenge 

in biology. To address this, we developed a synthetic approach based on nanobodies to drive assembly 
and activation of cell surface receptors and apply the concept by manipulating receptors that 

govern plant symbiosis with nitrogen-fixing bacteria. We show that the Lotus japonicus Nod factor 
receptors NFR1 and NFR5 constitute the core receptor complex initiating the cortical root nodule 
organogenesis program as well as the epidermal program controlling infection. We find that 
organogenesis signaling is mediated by the intracellular kinase domains whereas infection requires 
functional ectodomains. Finally, we identify evolutionarily distant barley receptors that activate root 
nodule organogenesis, which could enable engineering of biological nitrogen-fixation into cereals. 


iving cells must constantly monitor their 
environment to integrate changes and to 
communicate with neighboring cells and 
microorganisms. Cell surface receptors 
perceive external signals and convert 
stimuli into intracellular responses. There is, 
however, a limited understanding and a lack 
of methodology for exploring how individual 
receptors assemble and function in signaling- 
competent complexes that coordinate the 
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downstream responses. We tackle this prob- 
lem in vivo using single-pass transmembrane 
cell surface receptors that govern plant-microbe 
interactions. Plants interact with a wealth of 
microbes and need to distinguish between 
those that pose a risk and those that offer a 
potential benefit. To this end, plants use pattern- 
recognition receptors, including lysine motif 
(LysM) receptors, that perceive microbial- 
derived carbohydrate signals and mount an 
intracellular response (J-4). Plant LysM re- 
ceptors recognize conserved cell wall compo- 
nents, such as chitin from pathogenic fungi 
(5, 6), Myc factors in arbuscular mycorrhizal 
symbiosis (7, 8), and Nod factors in nitrogen- 
fixing symbiosis with rhizobia (2, 9-12). Genetic 
studies in the model legume Lotus japonicus 
(Lotus) have identified two LysM Nod factor 
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receptors, NFR1 and NFR5 (0), along with 
the symbiosis receptor-like kinase SYMRK (13), 
which initiate and control the two developmen- 
tal programs leading to nodule organogenesis 
and infection (intracellular accommodation), 
respectively. Loss-of-function mutations in these 
three receptors render plants incapable of es- 
tablishing root nodule symbiosis. To study 
how the receptors collaborate during signaling 
and to define the active receptor complexes, 
we developed a synthetic approach based on 
single-domain antibody fragments from cam- 
elids (called nanobodies) to actively promote 
receptor assembly. We used the nanobody in- 
tervention to identify the core receptor com- 
plex that induces signaling in root nodule 
symbiosis in legumes. 


Nanobody-driven assembly defines the 
receptor complex initiating organogenesis 


To investigate receptor complex activation, we 
developed technology that enables the con- 
trolled assembly of receptors in living cells and 
used symbiotic receptors as proof-of-principle 
targets. To drive assembly, we used llama- 
derived nanobodies as they are small, specific, 
high-affinity binders that are active in the re- 
ducing environment of the cytosol (/4). We 
generated a nanobody with high affinity to the 
Lotus NFR5 kinase (Nbygrs) and used it as a 
C-terminal tag on the Lotus NFR1 receptor 
(Fig. 1A and fig. S1). We reasoned that Nbygrrs 
fused to the C-terminal tail of NFR1 would 
specifically recognize and bind the kinase do- 
main of the native NFR5 receptor and drive 
the formation of a heteromeric complex. This 
was tested by expressing nanobody-tagged 
NFRI receptor constructs from the native Nfri 
promoter and terminator in Agrobacterium 
rhizogenes—induced roots of nfrJ mutant 
plants (Fig. 1B). We observe nodule formation 
independent of rhizobia or Nod factor, illus- 
trating that receptor assembly is the key step 
in symbiotic signaling (Fig. 1C). In a control ex- 
periment, we omit the nanobody and observe 
that no spontaneous nodules are formed (Fig. 
1C). To establish a general approach that does 
not rely on nanobody generation for each 
protein of interest, we next used the well- 
characterized LaG16 nanobody that binds 
superfolder green fluorescent protein (GFP) 
(15). We fused GFP to NFR5 and the LaG16 
nanobody to NFR1 and expressed the constructs 
from their native promoters and terminators 
in nfr1 nfr5 double mutant plants (Fig. 1E). 
When NFR1 is associated to NFR5 through 
nanobody binding to GFP, nodules are formed 
independently of rhizobia (Fig. 1, D and F). 
Similarly, organogenesis is activated in the 
reverse situation with GFP fused to NFR1 and 
the nanobody fused to NFR5 (fig. $2). To test 
whether this phenotype is dependent on the 
direct binding between the nanobody and 
GFP, we engineered a mutant version of LaG16 


SCIENCE science.org 


A NFR1 NFRS5 B [pxr> a 
> — signaling? Cc no rhizobia 
WT nfr1 
b 
NFR5 ° 
kinase si?) 
ND vers 210. 
wn 
o 
D nfr1 nfr5 - no rhizobia 25] | 
. 3 ; 
Zz Z . & 
07 co) or cnoo- ‘—CKD- 
0/29 0/11 14/22 
S 
9 s ito} 
oO om o 
> Le LE. 
‘ ~ Zz z 
: 2 
a . Nb vers 
LaG16 
E G 
4 pur Nér1 |{LaG 76} int | BRS Nir5 {GFP ins | NFR5-GFP + + 
mCherry-LaG16 + - 
F no rhizobia mCherry-LaG16m - + 
WT nfr1 nfr5 Input| 
305 b WB: GFP 
ss b Input 
6204 L WB: mCherry  &| 
al 
Be ° & IP: mCherry 
310; o ° WB: GFP 
$ “ : IP: mCh 
= i ° : mCherry 
a a a a 
04 -cam- -m- fF om G -cm WB: mCherry == 
0/29 0/17 9/11 0/15 10/12 0/30 
o ip te tf 
> L L 
> 2 Z 
ao 
E 
S 
LaG16m mCherry 


Fig. 1. Nanobody-driven assembly defines the receptor complex activating organogenesis. (A) Experi- 
mental design for nanobody-induced complex formation to activate symbiotic signaling. (B) Schematic 
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nanobody with no binding to GFP. (G) Coimmunoprecipitation of LaG16- or LaGl6m-tagged mCherry and 
GFP-tagged NFR5 from Nicotiana benthamiana leaf extracts. Anti-GFP and anti-mCherry antibodies were used 
for immunoblotting. LaGl6m is the mutated nanobody with no binding to GFP. In (C) and (F), lowercase 
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Fig. 2. The core symbiotic signaling complex activating organogenesis is conserved. (A) Number of 
nodules formed on Medicago lyk3 nfp double mutant plants expressing the indicated constructs in the 
absence of rhizobia. Nodules were counted 9 weeks after hairy root transformation. Circles represent 
individual plants. Numbers below the boxplots specify number of nodulated plants out of the total number of 
plants. The letter “m” represents the mutated nanobody with no binding to GFP. Lowercase letters 
indicate significant differences between samples [analysis of variance (Kruskal-Wallis) and post-hoc analysis 
(Dunn's test), P < 0.05]. (B) Representative brightfield and YFP fluorescence (transformation control) 
images of Medicago roots, showing the nodulation phenotype in the absence of rhizobia. Scale bar: 1 mm. 
(C) Representative brightfield and YFP fluorescence (transformation control) images of Lotus roots 
expressing NFR1-LaG16 with NFR5-GFP, kinase-dead (kd) NFR1-LaG16 with NFR5-GFP, NFR1-LaG16 with 
SYMRK-GFP or NFR5-LaG16 with SYMRK-GFP. n = total number of plants. Scale bars: 1mm. 


with three bulky tryptophan residues in the 
nanobody paratope. In contrast to functional 
LaG16, mutated LaG16 (LaG16m) lost the abil- 
ity to bind GFP-tagged NFR5 in tobacco pull- 
down assays (Fig. 1G). Furthermore, when 
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LaGi6m is fused to NFR1, it fails to initiate 
signaling leading to organogenesis (Fig. 1F), 
demonstrating that NFR1I-NFR35 receptor as- 
sociation is essential for signal transduction. 
To explore whether signaling is enhanced by 


assembling more receptors, we next coupled 
two LaG16 nanobodies in series to either NFR1 
or NFR5, separated by a 4.0-residue-long (~130 A) 
flexible linker which should sterically allow 
one receptor to associate with two co-receptors 
(Fig. 1F and fig. $2). We see no difference in 
nodule formation when one or two nanobodies 
are used, indicating that a single association 
event between NFR1 and NFR5 receptors is 
sufficient to initiate nodule organogenesis (Fig. 
1F and fig. $2). 


The core signaling complex is conserved 
in legumes 


To understand whether the heteromeric re- 
ceptor complex corresponds to the activated 
receptor state in other legume-rhizobia sym- 
bioses, we performed analogous experiments 
in Medicago truncatula (Medicago), in which 
we associated the two Nod factor receptors 
LYK3 and NFP (9, 16) using nanobodies. We 
observe that a heteromeric LYK3-NFP receptor 
complex is sufficient to initiate organogenesis 
in Medicago in the absence of external signals 
(Fig. 2, A and B), demonstrating that the trig- 
gering event for signal activation leading to 
organogenesis is the association of a conserved 
core receptor complex comprising one NFR1/ 
LYK3 receptor and one NFR5/NFP receptor. 
To understand whether symbiotic signaling is 
exclusively dependent on the physical assem- 
bly of the receptors or if it also requires a 
phosphorylation event, we introduced a point 
mutation in NFR1 impairing its kinase func- 
tion (17). We find that plants expressing the 
kinase-dead NFRI1 fail to form nodules (Fig. 
2C), showing that the protein kinase activity 
mediates downstream signaling. The SYMRK 
receptor containing a malectin-like domain 
and leucine-rich repeats in the ectodomain 
was shown to interact with NFR5 and to in- 
duce spontaneous nodules when overexpressed 
(8). To decipher the composition and func- 
tional requirements of the core complex, we 
tested nanobody-assembled NFR1-NFR5 sig- 
naling in nfri nfrd5 symrk triple mutant plants. 
The transformed roots show no nodulation 
(Fig. 2C), verifying the dependence of signal- 
ing on the common symbiotic pathway and 
confirming the vital downstream function of 
SYMRK. To investigate whether it is possible 
to drive symbiotic signaling from SYMRK in- 
dependently of the formed NFRI-NFR5 com- 
plex, we next used nanobodies to associate 
SYMRK to either NFR1 or NFR5 (Fig. 2C). We 
observe that neither the SYMRK-NFRI nor the 
SYMRK-NFR5 complex can drive organogene- 
sis (Fig. 2C), showing that the essential role of 
SYMRK in nodulation requires either a ligand- 
or nanobody-induced NFRI-NFR5 complex. 
To verify that the activated receptor complex 
operates through the known organogenesis 
pathways, we next explored its dependency 
on cytokinin signaling (79, 20). We find that 
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Fig. 3. Signaling activates 
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expression in WT plants inoculated 
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expressing NFR1-LaG16 and NFR5- 
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expressing GUS. For the stable 
line root hair image, three images 
were overlaid. Scale bars: 0.1 mm. 
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mation. Numbers below the boxplots 
specify number of nodulated 
plants out of the total number of 
plants. In (A) and (C), lowercase 
letters indicate significant differences 
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the activated NFR1I-NFR5 complex is unable 
to initiate organogenesis in the hk] loss-of- 
function cytokinin receptor mutant (Fig. 2C). 
Together, the data show that the NFR1-NFR5 
complex is essential to initiate nodule organo- 
genesis and that the downstream signaling 
cascade requires the symbiotic pathway. 


Activated complex initiates both cortical 
and epidermal programs 


We next investigated the effect of the nanobody- 
induced NFRI-NFR5 complex on the in- 
fection program by inoculating plants with 
Mesorhizobium loti (M. loti) bacteria, the sym- 
biont of Lotus. We monitored infection by 
M. loti through the appearance of pink, 
leghemoglobin-producing nodules on the 
roots. Although an increased number of white 
nodules form on plants expressing the 
nanobody-associated NFR1I-NFR5 complex, we 
see that formation of infected pink nodules is 
reduced compared with plants expressing the 
native receptors (Fig. 3A and fig. S3). This sug- 
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number of plants. 


gests that the activated NFRI-NFR5 complex 
initiates parallel developmental programs for 
both infection and organogenesis. But com- 
pared with organogenesis, infection is ham- 
pered by the presence of the nanobody-driven 
heteromeric NFRI-NFR5 state. We find that 
plants expressing the nanobody-induced com- 
plexes of either NFR1-SYMRK or NFR5-SYMRK 
are efficiently infected by rhizobia (fig. S4), 
suggesting that only the activated NFR1I-NFR5 
complex affects the infection program. Next, 
we investigated whether rhizobia that use an 
intercellular infection mechanism and cir- 
cumvent the epidermal root hair infection 
program can colonize nodules induced by 
the nanobody-mediated receptor complexes. 
For this, we analyzed infection with IRBG74, 
which uses an intercellular, crack-entry mech- 
anism (21, 22). The transgenic nfri nfr5 mutant 
roots form only white, uninfected nodules (fig. 
S5), suggesting that both infection modes are 
impaired when NFR1 and NFR35 are part of a 
nanobody-bound complex. To understand how 


downstream processes are affected by the 
presence of the nanobody-activated receptor 
complex, we created stable Lotus lines ex- 
pressing GFP-tagged NFR5 and LaG16-tagged 
NFRI. The stable lines were generated in the 
nfr1 nfr5 double mutant background with a 
6-glucuronidase (GUS) reporter gene driven 
by the NIN promoter. Looking at expression of 
the transcription factor NZN, which orches- 
trates both the epidermal infection program 
and the cortical organogenesis program, we can 
evaluate the effect of the activated receptor 
state on both programs (23). We observe nodule 
organogenesis independent of rhizobia 7 to 
10 days after germination in three independent 
stable lines. GUS staining reveals NIN activa- 
tion in nodules as well as in dividing cortical 
cells, which mark sites of emerging nodule 
primordia (Fig. 3B). This cortical NZN activa- 
tion is very similar to the native condition ob- 
served for wild-type (WT) plants inoculated 
with M. loti (Fig. 3B). In the wild type, we see 
that the epidermal infection program is tightly 
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Fig. 4. Barley receptors can A 
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NFR1- and NFR5-type receptors, 
espectively. (B) Structural 
modelling of barley RLK4 and 
RLK10. The amino acid sequence 
identities compared with the intra- 
cellular parts of Lotus NFR1 and 
NFR6 are indicated. Superposition 
of the modelled ectodomains of 
barley RLK4 and RLK10 compared 
with the crystal structures of 
Medicago LYK3 (PDB-ID: 6XWE) 
and NFP (PDB-ID: 7AU7), respec- 
tively. C-o root-mean-square 
deviations (RMSD) are indicated 
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NFR1 


regulated, as NIN is expressed only in a few 
epidermal root hair cells. By contrast, plants 
expressing the nanobody-driven NFR1-NFR5 
complex show broad NIN expression in nu- 
merous epidermal cells along the root (Fig. 
3B). This wider, less-regulated epidermal NIN 
activation might interfere with the coordi- 
nated timing of organogenesis and infection. 
We quantified infection efficiencies by com- 
paring WT plants with the stable Lotus lines 
expressing the NFRI-NFR5 complex 10 days 
after inoculation with M. loti. We observe a 
strong reduction in the number of infection 
threads formed when nanobodies are used to 
constitutively activate NFRI-NFR5 (Fig. 3D). 
Because both organogenesis and infection are 
initiated in transgenic roots expressing the 
nanobody-driven receptor complex, we took 
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the opportunity to ask whether certain re- 
ceptor domains contribute differently to the 
two developmental processes. For this, we en- 
gineered nanobody-mediated receptor com- 
plexes lacking the ectodomains of NFR1, NFR5, 
or both and expressed them in transgenic Lotus 
roots. We find that all ectodomain-lacking 
versions of the complexes are still able to ac- 
tivate organogenesis, albeit with a reduced 
efficiency compared with full-length receptors 
(Fig. 3C). By contrast, removal of any ectodo- 
mains results in complete loss of infection by 
Mz. loti as no infected pink nodules appear 
(Fig. 3A). These data show that the infection 
program is dependent on functional Nod factor- 
recognizing ectodomains, whereas organoge- 
nesis signaling can be mediated solely from 
the intracellular NFR1I-NFR5 domains. Fur- 


CERK6 
NFR5 


thermore, our findings demonstrate that it is 
possible to bypass the ligand-induced signal- 
ing function conferred by the ectodomains to 
induce the cortical organogenesis program. 


Cereal receptors can activate root 
nodule organogenesis 


The ability to develop nitrogen-fixing root nod- 
ules is restricted to a single clade of flowering 
plants (24). However, most plants, including 
cereals, can form arbuscular mycorrhizal sym- 
biosis by activating the common symbiosis 
pathway that is shared with root nodule sym- 
biosis in legumes (25, 26). This dichotomy in- 
dicates that the LysM receptors from cereals 
are unable to activate downstream signaling 
leading to nodulation. To test this, we analyzed 
the barley genome and found seven LysM 
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receptor-like kinases (RLKs) belonging to the 
NFRI1 and NFR5 families (Fig. 4A). Phylo- 
genomic analysis and structural modeling of 
these barley RLKs revealed that RLK4 and 
RLK10 show the greatest similarity to NFR1/ 
LYK3 and NFR5/NFP, respectively (Fig. 4, A 
and B, and fig. S8) (ZI, 12, 27). We first tested 
the ability of these individual barley RLKs to 
functionally complement the nfri and njfr5 
single or nfr1 nfr5 double mutants when ex- 
pressed in Lotus from the native Nfri and 
Nfr5 promoters and terminators. Despite 
the similarity, RLK4 and RLK10 are unable 
to functionally complement NFR1 and NFR5 
(fig. S6 and table S1). We then asked whether 
nanobody-driven association of Lotus NFR5 
with barley RLK4 expressed from the native 
Nfr1 promoter and terminator induces nod- 
ule formation in nfril nfr5 double mutant 
plants. We observe that the RLK4-NFR5 com- 
plex leads to nodule organogenesis indepen- 
dent of rhizobia (Fig. 4C). Likewise, nodules 
form when barley RLK10 is in a nanobody- 
mediated complex with Lotus NFR1 (Fig. 4C). 
Moreover, we find that the barley RLK4-RLK10 
receptor complex associated through nano- 
bodies leads to nodule organogenesis inde- 
pendent of the Nod factor receptors in both 
Lotus and Medicago (Fig. 4, C and D, and fig. $7). 
Barley RLK4 and RLK10 fail to support infec- 
tion as no pink nodules appear when plants 
are inoculated with M. loti (fig. S6). Together, 
these results demonstrate that LysM receptors 
from barley—which diverged from Lotws 200 mil- 
lion years ago (28)—have retained the ability 
to activate the symbiotic pathway leading to 
root nodule organogenesis in legumes. We 
next investigated whether the Lotus chitin 
receptor CERK6 (29)—the closest homolog to 
Lotus NFR1 and in the same receptor family as 
barley RLK4—can support nodulation. Lotus 
CERK6 shares 62% sequence identity with 
Lotus NFR1 (table $1), but only 2 out of 27 plants 
form a single nodule when CERK6 and NFR5 
are associated through nanobodies in the n/r1 
nfr5 mutants (Fig. 4C). Because RLK4 shares 
only 51% sequence identity with Lotws NFR1 
(table S1), signal activation cannot be solely 
explained by sequence similarity. Although 
the molecular details that enable certain re- 
ceptors to activate organogenesis remain elu- 
sive, these results show that signaling specificity 
is ensured by receptor complex composition. 
In this study, we developed a synthetic ap- 
proach to investigate the composition and 
activation of multicomponent and branched 
receptor systems. Our nanobody technique 
for driving complex formation, together with 
other approaches such as chemical control 
dimerization (30), will be applicable when 
studying other cell surface receptor systems. 
We used the nanobody-based approach to 
address a longstanding question in the field 
of plant-microbe interactions: namely, what 
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constitutes an activated LysM receptor sig- 
naling complex. We demonstrate that NFR1 
and NFR5 form the core receptor complex 
initiating root nodule symbiosis in Lotus. Our 
data match earlier findings showing that root 
hair deformation and calcium influx as the 
initial responses to rhizobia are maintained 
in the symrk mutant but abolished in the 
nfri and nfr5 mutants (13, 31, 32), suggesting 
that the NFR1-NFR5 complex has a signaling 
function independent of SYMRK. The data 
support that the first critical event for signal 
activation involves assembly of a core NFRI- 
NFR5 complex, which then interacts with 
SYMRK. We show that although the hetero- 
meric NFR1-NFR5 complex activates the epi- 
dermal program essential for infection by 
rhizobia, nanobody-driven activation leads 
to inefficient colonization by rhizobia. A tight 
spatial and temporal synchronization be- 
tween the infection program and the nodule 
organogenesis program is essential for effi- 
cient root nodule symbiosis. NIN is critical for 
both infection and organogenesis and con- 
tinuous signaling from NFR1-NFR5 through 
nanobody activation and misregulation of 
NIN likely compromises the synchronization 
of bacterial attachment, root hair curling, and 
infection thread progression with the forma- 
tion of nodule primordia leading to inefficient 
infection. Our data show that the intracellular 
kinase domains are necessary and sufficient 
for organogenesis signaling when artificially 
assembled using nanobodies whereas func- 
tional receptor ectodomains are important 
for infection. The main function of the NFR1 
and NFR5 ectodomains is to perceive Nod 
factors, which naturally allow synchroniza- 
tion between receptor complex activation 
and infection and organogenesis. The ecto- 
domains may serve additional functions like 
interaction scaffolds to mark the site of in- 
fection. We demonstrate that barley RLK4 
and RLK10 receptors have the capacity to 
activate symbiotic signaling in legumes, sug- 
gesting that the ability to activate organo- 
genesis is deeply rooted in the LysM receptor 
family, and was co-opted by plants in the 
nitrogen-fixing clade for root nodule symbiosis. 
Evidence for this conserved signaling function 
in barley suggests routes for engineering bi- 
ological nitrogen-fixation into cereals for more 
sustainable food production in the future. 
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QUANTUM SIMULATION 


A superconducting quantum simulator based 
on a photonic-bandgap metamaterial 


Xueyue Zhang**+, Eunjong Kim'?{, Daniel K. Mark?, Soonwon Choi’, Oskar Painter?24* 


Synthesizing many-body quantum systems with various ranges of interactions facilitates the study of 
quantum chaotic dynamics. Such extended interaction range can be enabled by using nonlocal degrees 
of freedom such as photonic modes in an otherwise locally connected structure. Here, we present 

a superconducting quantum simulator in which qubits are connected through an extensible photonic- 
bandgap metamaterial, thus realizing a one-dimensional Bose-Hubbard model with tunable hopping 
range and on-site interaction. Using individual site control and readout, we characterize the statistics of 
measurement outcomes from many-body quench dynamics, which enables in situ Hamiltonian learning. 
Further, the outcome statistics reveal the effect of increased hopping range, showing the predicted 
crossover from integrability to ergodicity. Our work enables the study of emergent randomness from 
chaotic many-body evolution and, more broadly, expands the accessible Hamiltonians for quantum 


simulation using superconducting circuits. 


ealizing a scalable architecture for quan- 

tum computation and simulation is a 

central goal in the field of quantum in- 

formation science. Although architec- 

tures with nearest-neighbor (NN) coupling 
between quantum particles on a lattice are 
prevalent, quantum systems with long-range 
interactions can realize a richer set of com- 
putational tasks and physical phenomena 
(1-4). For instance, in the case of gate-based 
quantum computation, coupling beyond the 
NN level enables nonlocal gate operations be- 
tween qubits, which can reduce the overhead 
of quantum algorithms and lift the restrictions 
on code rate and distance of local-interaction- 
based quantum error-correcting codes (5, 6). 
In the case of analog quantum simulation, the 
inclusion of long-range interactions can alter 
the behavior of otherwise integrable many- 
body systems (7, 8), resulting in quantum chaotic 
dynamics, which is at the root of such topics 
as quantum thermalization (9) and quantum 
information scrambling (JO). Furthermore, 
control over the range of lattice connectivity 
grants access to different physical regimes 
and the crossover between them, such as in 
many-body quantum phase transitions (17-13) 
and the hydrodynamics of nonequilibrium 
quantum states (/4). 

For engineered quantum systems consist- 
ing of interacting quantum particles on a lat- 
tice, it is often challenging to scale to larger 
lattice sizes while maintaining a high degree 
of lattice connectivity and single-site con- 
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trol. One common approach, developed for 
trapped-ion and neutral-atom systems, is to 
use resonant modes of either vibrational (7) 
or optical (15) cavities as a quantum bus for 
mediating interactions between the internal 
states of atoms across the lattice. Similar schemes 
have been adopted in superconducting quantum 
circuits, realizing systems as large as 20 qubits 
with all-to-all coupling via a common microwave 
cavity (16). Increasing the number of lattice 
sites in this case, however, leads to either par- 
asitic coupling arising from dense placement 
of sites in a fixed-volume cavity or frequency- 
crowding effects stemming from the increased 
spectral density of cavity modes when in- 
creasing the cavity size (17). 

An alternative approach for connecting 
quantum particles on a lattice is to construct 
a quantum bus from an intrinsically extensible 
structure, such as a waveguide. Along this di- 
rection, engineered photonic-bandgap wave- 
guides have been proposed as a quantum bus 
that simultaneously protects quantum par- 
ticles from radiative damping through the 
waveguide while allowing for extended-range 
lattice connectivity (18). The waveguide-bus 
concept has been investigated in the context of 
many-body simulation with cold atoms cou- 
pled to engineered nanophotonic waveguides 
(18, 19), and recent experiments have explored 
qubit-photon bound states in superconducting 
quantum circuits with microwave photonic- 
bandgap waveguides (20-24). However, the 
realization of a scalable many-body quantum 
simulator, with single-site quantum-particle 
control and a high level of lattice connectivity, 
has remained an open challenge. 

We demonstrate a scalable many-body quan- 
tum simulator consisting of a one-dimensional 
(1D) lattice of superconducting transmon qubits 
coupled to a common metamaterial wave- 
guide. This system provides both tunable-range 
connectivity between qubits and full single- 


site control and state measurement of individ- 
ual qubits. The waveguide acts both as a bus 
for mediating exponentially decaying long- 
range interactions between qubits and as a 
Purcell filter enabling multiplexed, rapid read- 
out of the qubit states with high fidelity. This 
system realizes an extended version of the 
Bose-Hubbard model with tunable hopping 
range and on-site interaction. Using our abil- 
ity to efficiently collect measurement outcomes 
from many-body quench dynamics—enabled 
by the fast experimental repetition rate of our 
system—we perform direct analysis of out- 
come statistics to learn Hamiltonian parame- 
ters in situ and study the effect of hopping 
range on the evolution of randomness across 
the system. Specifically, we observe a distri- 
bution of outcome bit-string probabilities re- 
flecting the ergodic nature of the Hamiltonian 
with long-range hopping. This result exper- 
imentally confirms the expectation from quan- 
tum chaos for interacting many-particle systems, 
highlighting the connection between ergodic 
unitary dynamics and its effective statistical de- 
scription in terms of random matrix theory (25). 


Metamaterial-based quantum simulator 


The backbone of the many-body quantum sim- 
ulator in this work is a metamaterial waveguide 
formed from a chain of lumped-element inductor- 
capacitor (LC) microwave resonators. The wave- 
guide can be described by a generic model 
(Fig. 1A) of a 1D cavity array with NN coupling 
t (26, 27). The corresponding dispersion re- 
lation (Fig. 1B) is given by ay, = W¢ + 2tcos(kd), 
exhibiting a passband centered around the 
cavity frequency ,. with a bandwidth of 4¢, 
where x is the wave vector and d is the lattice 
constant of the array. The bandgap at frequen- 
cies below We. = We — 2t (above We. + = We + 
2t) is denoted as the lower (upper) bandgap, 
abbreviated as LBG (UBG). Inside the band- 
gaps, the off-resonant coupling between a bare 
quantum emitter and the waveguide modes 
gives rise to an emitter-photon bound state 
(28) whose photonic tail is localized around 
the emitter. Localization follows a spatial pro- 
file (F1)*"e~!*//§ in the LBG or UBG (27), where 
Az is the displacement in the number of unit 
cells from the emitter and €is the localization 
length controlled by the detuning A between 
the band-edge frequency and the transition 
frequency of the bound state. The overlap of 
two bound states results in photon-mediated 
coupling with a range covering multiple unit 
cells, i.e., long-range coupling, exhibiting a 
greater strength and a more extended range 
§ at a smaller detuning |A| (Fig. 1C). 

The metamaterial waveguide consists of a 
42-unit-cell array of capacitively coupled lumped- 
element microwave resonators (Fig. 1, D and 
E) and is equipped at both ends with engi- 
neered tapering sections, designed to reduce 
the impedance mismatch to external 50-ohm 
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Fig. 1. Metamaterial- A 

based quantum simula- t 
tor. (A) Schematic p I 
showing a 1D array of see | é) 
coupled cavities with 

nearest-neighbor coupling 

t. Each cavity is occupied 

by a quantum emitter DD. = 
(orange ball) with coupling 
g to the cavity. (B) Dis- 
persion relation of the 
coupled cavity array in (A) 
with a passband between 
We centered at we = = 
(bandwidth of 4t). The 
LBG (UBG) below (above) 
the passband is shaded in 
green (purple). (C) Top 
(Bottom): Cartoon of two 
emitter-photon bound 
states at small (large) 
detuning |A|, indicated by 
dark (light) orange arrows 
in (B), exhibiting an 


extended (restricted) spatial range and large (small) photonic component in the 
bound states. (D) Electrical circuit realization of (A) with capacitively coupled 
LC resonators and transmon qubits corresponding to the cavity array and 

the quantum emitters, respectively. The coupling capacitors are color coded in 
accordance with (A). (E) Optical micrograph (false colored) of the fabricated 
quantum simulator with 42 metamaterial resonators (lattice constant 

d = 292 um) colored blue connected to input-output ports (red) via tapering 


input-output ports at frequencies lying within 
the passband of the waveguide (24, 29). Each 
of the middle 10 metamaterial resonators (unit 
cells labeled by 7 = 1 to 10) couples to a trans- 
mon qubit (30), which serves as the quantum 
emitter. Individual addressing of each qubit 
is achieved by excitation (XY control) from a 
charge drive line and frequency tuning (Z con- 
trol) from a flux bias line. Dispersive qubit read- 
out is enabled by capacitively coupling each 
qubit Q; to a compact readout resonator R;, 
which itself is then coupled to the metamate- 
rial resonator of the same unit cell. The entire 
metamaterial and transmon qubit system (the 
device) is fabricated by using evaporated thin- 
film aluminum on a high-resistivity silicon 
substrate, with fabrication procedures detailed 
in (21, 29). Further details of the device modeling 
and the experimental setup used to measure 
and test the device are discussed in supple- 
mentary text I and II, respectively. 

This realization of the device enables qubit 
readout using the passband of the metamate- 
rial waveguide with built-in protection against 
Purcell decay channels (supplementary text III). 
The transmission spectrum through the wave- 
guide (Fig. IF) shows a passband ranging from 
We /2n = 5.01GHz to ,+/2n = 7.08 GHz 
with ripples smaller than 8dB near the center. 
The extinction ratio of the transmission be- 
tween that measured in the passband and that 


SCIENCE science.org 


Frequency w 


w 4t 
e= 


—n/d 0 
F Wavevector k 
a 0 
ZS 25 
ie 
2 -50 
a 
E -75 
2-100 RB, R,R, R, R, R, R, R, R, 
£ 40 50 60 7.0 


Frequency (GHz) 


=i) | 


measured in the bandgaps is greater than 
65 dB, with a sharp transition in the trans- 
mission occurring within 100 MHz of the band 
edges. In the middle of the passband, reso- 
nances associated with the readout resonators 
are observed between 5.574 and 6.328 GHz. 
The average decay rate of 10 readout resona- 
tors is KR, /2n = 11.8 MHz, enabling fast, high- 
fidelity multiplexed readout while maintaining 
a low level of readout cross-talk. For details of 
readout methods and characterization, refer 
to supplementary text IV. 


Bose-Hubbard model with long-range hopping 


The spatially extended bound-state excitations, 
formed between transmon-qubit excitations 
and waveguide photons of the metamaterial- 
waveguide bus, creates a lattice of interacting 
microwave photons (37). This quantum system 
is described by an extended version of the 1D 
Bose-Hubbard model with tunable long-range 
hopping and on-site interaction. Specifically, 
each bound state formed from qubit Q;, inher- 
iting the level structure of an anharmonic os- 
cillator from a transmon qubit (30), serves as a 
bosonic site with local site energy €; = o1,;and 
the on-site interaction U; = @»; — oii. Here, 
@o1,; and 2, are the transition frequencies 
of the bound state on site Q; from its ground 
state |0) to the first excited state |1) and that 
from the first to the second excited state |2), 


etry 
amt 
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sections (purple). Ten qubits (Q;, colored orange), controlled by individual charge 
drive lines (pink) and flux bias lines (dark blue), and their readout resonators 

(R;, colored green) couple to the 10 inner unit cells of the metamaterial waveguide 
with a zoomed-in view in the left inset. Detailed view of the coupling region is shown 
in the right inset. Two auxiliary qubits (yellow) are not used in this experiment. 

(F) Transmission spectrum through the metamaterial waveguide (red curve) with 
black arrows indicating the 10 resonances of the readout resonators R;. 


respectively. In addition, the long-range hop- 
ping J;; is enabled by the overlap between a 
pair of qubit-photon bound states on sites Q; 
and Q;. The Hamiltonian of this model that 
captures the basic processes mentioned above 
can be written as 


H/h= 2 Fiabe + ra (Ay — 
rea, (1) 


where bt (;) is the creation (annihilation) 
operator and 7; = bt b; is the number operator 
on site Q;. The parameters of the Hamiltonian 
realized in this simulator can be learned through 
experiments enabled by the precise, single- 
site-level control over qubits. 

We measure the on-site interaction U; (Fig. 2A) 
by performing spectroscopy of 2 after ini- 
tializing Q; in its first excited state |1). From 
within either bandgap, |U;| decreases as Wo; 
approaches the closest band edge owing to 
dressing from the passband modes of the 
metamaterial (22)—i.e., the Lamb shift. In the 
UBG, a wide tuning range of |U;| is achievable 
from the strong hybridization between the 
|1)-|2) transition and the band-edge modes at 
(@o1— We,+)/2n < 300 MHz. The magnitude 
of hopping |.J;;| is measured from vacuum 
Rabi oscillations between sites Q; and Q; by 
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Fig. 2. Hamiltonian learning. (A) On-site interaction |U;| versus frequency wo) 
with measured values indicated by colored circles. (B) Hopping amplitude \Ji| 
versus frequency with experimental data shown as markers (error bars indicate a 
standard deviation). Colors represent the distance |i — j| between sites. Four 
gray-scale arrows specify frequencies in Fig. 3. (€) Localization length & extracted 
by fitting the exponential decay of measured hopping rate [results of polynomial 
fitting of datapoints in (B)] as a function of distance |i —j| (insets) at a few 
different frequencies. The darkness of a marker matches a fitting curve in the 
inset at the same frequency. In (A) to (C), green (purple) shading on the left 
(right) corresponds to LBG (UBG), and theory curves (solid) are obtained from 
numerical calculations using an identical circuit model. (D) Left (Right): Cartoon 
illustrating hopping from a site inside the LBG (UBG) with positive or negative 
sign represented in red or blue and the amplitude represented by opacity, with 
alternating (all positive) signs denoted as + — +— (++-++). (E) Pulse 
sequence for many-body evolution. Ten sites, Q to Qio, are initialized (X gate) in a 
bit-string Zine at their idle frequencies, then tuned to resonance for time t during 


the interaction stage, followed by a site-resolved single-shot readout at their idle 
frequencies to obtain a final bit-string z. (F) Many-body fidelity estimator F, at 
01 /2n = 4.72 GHz versus evolution time t. The Fy curves assume J's from two- 
qubit measurement indicated by the dashed line in (B) with alternating signs 
(orange) and all positive signs (green), and from the numerical optimization 
(blue). The shading corresponds to a standard deviation for 40 randomly 
chosen Zinit’s in the five-excitation sector. Inset: Fy at t = 0.6 us versus J78 
with optimized parameters on the remaining J;;'s, where the brown and the black 
dashed lines indicate J7 extracted from (B) and the numerical optimization, 
respectively. (G) Comparison of |J;,| interpolated from two-qubit measurements 
(colored circles) and from numerical optimization (blue triangles with error bars 
for 68% confidence interval), corresponding to the orange and the blue curves in 
(F), respectively. All nearest-neighbor hopping Wii's are shown (details in the 
inset), whereas for larger hopping distances |i — j| > 1 only the average values are 
indicated. The navy stars represent the maximum Fy at t = 0.6 us in (F) and the 
optimized J7¢ in the insets of (F) and (G). 


initializing one site with az-pulse and tuning 
@o; Of both sites on resonance for a duration 
t with fast flux pulses. For a fixed distance 
|i —J|, |Jij| increases with a decreasing |A|, 
resulting from larger photonic components 
of the bound states (Fig. 2B). Compared to 
the LBG, the UBG exhibits larger |J;,;| at the 
same |A|, owing to a stronger coupling g of the 
bare qubits to the metamaterial at higher fre- 
quencies and the breakdown of the tight-binding 
cavity array model in the circuit realization 
(Fig. 1D; also see supplementary text I). At a 
specific wg, | J; | decreases exponentially as a 
function of distance |i —j|, resembling the 
profile of the photonic tail in a qubit-photon 
bound state (Fig. 2C). From fitting the expo- 
nential decay curve we extract the localization 
length & which ranges from = 1.4:to 4.2, with 
the largest localization length occurring at 
the smallest achievable band-edge detunings. 
For even smaller detunings, the eigenstate of 
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the two interacting bound states merges into 
the passband and becomes radiative to the 
waveguide. 


Many-body Hamiltonian learning 


Beyond the above single- and two-qubit mea- 
surements, we perform in situ many-body 
characterization of Hamiltonian parameters 
(32, 33), which are otherwise hard to access. 
For example, the sign of the hopping term J; ; 
inherits the spatial profile of the photonic 
component of the bound states. In the case of 
the bound states in the UBG, the sign of the 
hopping terms are all uniform (positive), where- 
as for bound states in the LBG, the hopping 
terms alternate sign as the distance between 
lattice sites increases by one (Fig. 2D). This is 
due to the photonic component of the bound 
state behaving as a defect mode inside the 
bandgap, exhibiting a spatial profile resem- 
bling the wave vector at the nearest band edge 


(& = 0 at the upper band-edge and k = n/d at 
the lower band edge). Although insignificant 
in measurements involving only two lattice 
sites, the sign of the hopping terms does alter 
the many-body dynamics of the system. Here, 
we use a many-body fidelity estimator Fz pro- 
posed in (33) to reveal this information. This 
fidelity estimator, which closely tracks the true 
many-body fidelity, is obtained for ergodic 
quench evolution of simple initial states (sup- 
plementary text VI). 

We follow the sequence described in Fig. 2E 
to perform the many-body quench evolution. 
The sequence consists of preparing a set of five 
randomly chosen sites in their first excited 
state, followed by using flux pulses to align mo; of 
all 10 sites for time t, and then finally per- 
forming site-resolved single-shot measure- 
ment on all lattice sites to obtain a 10-bit string 
& = NNy***No. The many-body fidelity estima- 
tor Fg is calculated by comparing bit-string 
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Fig. 3. Two-particle quantum A 10 ee B 

walk with increasing hopping v5 ao. 

range. (A) Evolution of the popu- om | 4.55 GHz 
eee ; S "3 —4.72 GHz 

lation (fj) on sites Q; to Qio © —480 GHz 

as a function of normalized evolu- ~? wn 

tion time Jij,1t. The system is Ls 

initialized in Zinit = 0000110000 

and the evolution occurs at " 

ie crciee  ,  e 

Eee ell a Site Site Site Site , 


tion times of 904, 781, 430, 
and 200 ns from left to right. (B) The second momen 


t tw as a function of normalized evolution time Jjj.17. 


Results calculated from the data in (A) are shown as solid curves with gray scales corresponding to frames 
in (A) and arrows in Fig. 2B. Result from numerical simulation of the integrable Hamiltonian is shown as 


the dotted curve, and u5 for a generic ergodic system 


statistics of repeated measurements with nu- 
merical simulation of the evolution assuming 
a set of Hamiltonian parameters in Eq. 1. The 
maximum Fy is achieved at the parameter val- 
ues closest to the Hamiltonian realized in the 
experiment. The fast repetition rate of this ex- 
periment enables us to perform a large number 
of measurements (1.6 x 10° in total), reducing 
statistical error and increasing sensitivity to 
small Hamiltonian parameter variations (see 
supplementary text V for details of qubit con- 
trol, pulse sequence, and repetition rate). 

We compare Fy at @o1/2n = 4.72 GHz using 
three different parameter sets for J;; in Fig. 
2F: a first set with amplitudes derived from the 
two-qubit experiments in Fig. 2B assuming al- 
ternating signs (Fig. 2D, left); a second set 
with the same amplitudes as the first but all 
positive signs (Fig. 2D, right); and a third 
set of optimized parameter values that maxi- 
mize Fy. The optimized hopping terms are 
restricted to be real-valued, with independent 
Jiis1 for each 7 = 1 to 9 and J;; for each dis- 
tance |i —j| > 1 (all qubit pairs of the same 
distance having the same J;;). An alternating 
sign of J;; with distance is favored, yielding 
a higher many-body fidelity compared to hop- 
ping terms with all positive signs. This is fur- 
ther evidenced by the alternating signs of the 
resulting optimized parameter set. Although 
we find small differences between the set of 
optimized hopping amplitudes and those 
from the two-qubit experiments with alter- 
nating signs (Fig. 2G), Fy of the optimized 
parameter set is markedly better. The sensitivity 
of Fy to the hopping terms is highlighted in 
the inset of Fig. 2F, where the variation of the 
fidelity versus J7 is shown. For details of the 
Fy calculation and parameter optimization, 
refer to supplementary text VI. 


Ergodic many-body dynamics with 

long-range hopping 

We now use the platform to study the effect of 
long-range hopping on the many-body dynam- 
ics. Specifically, the ergodicity of the 1D Bose- 


is indicated by the red dashed line. 


depends on the range of hopping, which ex- 
hibits integrable behavior with NN hopping, 
and chaotic behavior with long-range hopping. 
We study this crossover with various hopping 
ranges and investigate the resulting dynamics 
using both conventional one- and two-site cor- 
relators, and the statistics of the global bit- 
strings resulting from qubit-state measurement 
outcomes across the lattice. This latter tech- 
nique is particularly useful in identifying uni- 
versal signatures of ergodicity and the effect of 
decoherence at long evolution times. The cross- 
over between integrable and ergodic dynamics 
can be qualitatively visualized by a two-particle 
quantum walk (34-36) with initial excitations 
on sites Q;, and Q, using the sequence shown in 
Fig. 2E. The measured quantum walk at a few 
different a ;’s indicated by arrows in Fig. 2B is 
shown (Fig. 3A) as a function of normalized 
evolution time J;;,11, where Jj;,; is the av- 
erage NN hopping rate (the corresponding 
numerical simulations are provided in sup- 
plementary text VII, showing that the quan- 
tum walk patterns are not visibly affected 
by decoherence). The excitation wave packets 
smear over the system when (po; is close to the 
band-edge frequency. More quantitatively, 
this trend can be probed by computing the 
probability p. of measuring a certain bit-string 
g in the two-excitation sector at evolution 
time t. For a generic ergodic Hamiltonian, 
the second moment pty = SOP: (25), which re- 
flects the probability fluctuations, converges 
to u5 = 2/(D +1) after initial evolution (33) 
owing to the chaotic nature of its quantum dy- 
namics (D = 45 is the dimension of the two- 
excitation Hilbert space). No such convergence 
is expected in an integrable Hamiltonian owing 
to revivals associated with ballistic propaga- 
tion of wave packets. As an example, we show in 
Fig. 3B the results from the spin-1/2 XY model 
obtained from modifying the Hamiltonian in 
Eq. 1 by keeping only NN hopping terms in the 
hardcore limit. When @ , is closer to the band 
edge, the measured second moment deviates 
from the simulated integrable result and con- 


Hubbard model in the hardcore limit (U/J| > 1) 
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verges to 5 at an earlier normalized evolution 


time Jiii17 consistent with the breaking of 
integrability due to the extended hopping 
range. With |U/J| > 36 for all the measure- 
ments illustrated in Fig. 3, finite on-site in- 
teractions of the Bose-Hubbard model play a 
negligible role in the breaking of integrabil- 
ity (Supplementary text VIII). 

To further probe this ergodic nature of 
Hamiltonian with long-range hopping, we 
use the experimental evolution at 9/21 = 
4.72 GHz as an example. At a short time (t = 
16 ns), the excitations remain in their initial 
sites. This is visualized for a quantum walk 
with initial excitations on sites Q; and Q, in 
the left panel of Fig. 4C (evolution of popula- 
tion (7;)) and in the bottom left panel of Fig. 
4D (two-site correlator (7;7;)). The histogram 
P(pz) of experimentally measured bit-string 
probabilities {p. } at this early evolution stage 
(Fig. 4B, left) shows a distribution with a tail 
of large p, values, giving a large t. (Experiment 
curve in Fig. 4A). This is associated with an 
insufficient scrambling of the initially local- 
ized quantum information. At an intermediate 
time (t = 360 ns), the excitations are more 
spread out over the entire 1D lattice (Fig. 4D, 
middle left panel), forming a “speckle” pattern 
with site-to-site fluctuation that is associated 
with quantum interference. The quantitative 
signatures of this speckle pattern manifest 
in the histogram P(p.,) following the Porter- 
Thomas (PT) distribution (37) (Fig. 4B, mid- 
dle) and in the second moment i, settling to 
the ergodic value u5. The PT distribution re- 
sults from the randomness in the distribution 
of wavefunction magnitudes, which is pre- 
dicted by Berry’s conjecture (38) stating that 
the single-particle eigenstates of a chaotic sys- 
tem behave like random superpositions of 
plane waves. Similarly, in the many-body set- 
tings, the distribution of wave function mag- 
nitudes across basis states also follows the PT 
distribution. Our observation is the first ex- 
perimental verification of this many-body ver- 
sion of Berry's conjecture in a Bose-Hubbard 
system, whose extension in the thermodynamic 
limit provides the modern theory of quantum 
thermalization such as eigenstate thermaliza- 
tion hypothesis (39, 40). This draws a connec- 
tion between quantum many-body chaos and 
random matrix theory, leading to a deeper 
understanding of the randomness in many- 
body dynamics (32). Distinct from the ran- 
domness inherent in random circuits (25, 4D), 
the randomness in our case originates from the 
ergodicity of the time-independent Hamiltonian. 
In contrast to the experimental results, theo- 
retical calculations at the same evolution time 
using the integrable Hamiltonian show aggre- 
gated excitations on a few sites (Fig. 4E, middle) 
and the resulting larger value of , (Integrable 
theory curve in Fig. 4A). This comparison high- 
lights the effect of long-range hopping in prob- 
ing the ergodic regime. 
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Fig. 4. Ergodic many-body dynamics with long-range hopping at 4.72 GHz. 
(A) Second moment uy» as a function of evolution time t in our system from 
the experiment (orange) and the theory with the optimized parameter set 

in Fig. 2G (blue), compared to theoretical predictions of the integrable model 
(green). The shading on each curve corresponds to a standard deviation of the 
mean second moment for 20 randomly chosen initial bit-strings Zjpit in the 
two-particle sector, and the red dashed line represents the ergodic value 5. 
(B) Density histogram P(p,) of the distribution of experimental bit-string 
probabilities {p,} with the 20 initializations Zinit's at evolution times t = 16 ns, 


In addition, we study the impact of decoher- 
ence by juxtaposing the measurement results 
and the decoherence-free theoretical calcula- 
tion using the optimal learned Hamiltonian 
with long-range hopping. Before the evolution 
time of t ~ 1 us, the two cases agree in the sec- 
ond moment u, (Experiment and Theory curves 
in Fig. 4A), the quantum walk population (Fig. 
AC, left and middle), and the two-site correlator 
(Fig. 4D, middle panels), suggesting that these 
results are not affected by decoherence. After a 
long evolution time (t = 5.4 us, larger than the 
averaged Ramsey coherence time 73; = 1.16 us), 
the second moment of the two cases deviates 
from one another, and the experimental speckle 
pattern begins to wash out compared to the 
theoretical modeling (Fig. 4D, top panels). An- 
other probe of the decoherence is the histogram 
P(p,) of the measured bit-string probabilities 
(Fig. 4B, right). Here, the histogram deviates from 
the PT distribution, narrows substantially, and 
approaches a uniform distribution correspond- 
ing to a completely decohered, maximally mixed 
state. Additional numerical simulations of Ly 
and P(p,) for ergodic and integrable systems 
can be found in supplementary text VIII. 


Conclusion and outlook 


Our many-body quantum simulator is based 
on a 1D lattice of transmon qubits connected 
together using a superconducting metamate- 
rial, which exhibits photonic bandgaps that 
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protect qubit-photon bound states from de- 
cay and a transmission passband used for 
high-fidelity multiplexed qubit-state readout. 
Furthermore, the metamaterial plays the role 
of a scalable photonic bus to mediate tunable 
long-range coupling between qubit-photon 
bound states. This system of interacting bound 
states realizes a Bose-Hubbard model with long- 
range hopping. We characterize the system 
using conventional single- and two-qubit mea- 
surements along with a sample-efficient many- 
body Hamiltonian learning protocol. Lastly, 
we study the many-body quench dynamics of 
the system versus the range of the lattice hop- 
ping, revealing the ergodic nature of the ex- 
tended Bose-Hubbard model, as distinct from 
its NN-coupling counterpart. The major chal- 
lenge in probing long-time quantum evolu- 
tion in our experiment is the short Ramsey 
coherence time 7; , = 1.16 ps, limited by flux- 
noise-induced de phasine, Incorporating a 
single refocusing pulse has been shown to 
increase the coherence time to To; ; = 5.64 us 
at the single-qubit level (supplementary text 
Il. The extended quantum evolution times 
enabled by further dynamical decoupling, 
combined with the tunable-range coupling 
investigated in this work, provide valuable 
opportunities to explore nonequilibrium dy- 
namics with or without coupling to an envi- 
ronment and quantum phases of matter in 
the presence of frustration. 


076.0 0.2 % 0.6 0.8 0.0 aa 


% site a, Oe 


0.1 0.15 0.0 Bor 


360 ns, and 5.4 us from left to right [indicated by the dotted lines in (A)]. The 
solid lines show the PT distribution, and the dashed line in the right plot shows 
the value p; = 1/D of a classical uniform distribution. (€) Evolution of the 
population (n;) on sites Q: to Qio as a function of time t with Zinit = 0000110000 
in the cases of experiment, theory, and integrable theory from left to right. The 
white dashed lines at the bottom (in the middle) indicate t = 16 ns (360 ns). 
(D and E) Two-site correlator (f;f;) with Zinit = 0000110000 at evolution times 
t= 16 ns, 360 ns, and 5.4 us from bottom to top in the cases of experiment 
[left column of (D)], theory [right column of (D)], and integrable theory (E). 
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ZEOLITES 


A 3D extra-large-pore zeolite enabled by 1D-to-3D 
topotactic condensation of a chain silicate 


Jian Li+?:3*++, Zihao Rei Gao*}, Qing-Fang Lin®*+, Chenxu Liu®, Fangxin Gao®, Cong Lin?*”, 
Siyao Zhang®, Hua Deng®, Alvaro Mayoral®”°, Wei Fan”, Song Luo”, Xiaobo Chen’”, Hong He®?°, 


Miguel A. Camblor**, Fei-Jian Chen®®*, Jihong YuS* 


Zeolites are microporous silicates with a large variety of applications as catalysts, adsorbents, and cation 
exchangers. Stable silica-based zeolites with increased porosity are in demand to allow adsorption and 


processing of large molecules but challenge our synthetic ability. We report a new, highly stable pure silica 
zeolite called ZEO-3, which has a multidimensional, interconnected system of extra-large pores open through 
windows made by 16 and 14 silicate tetrahedra, the least dense polymorph of silica known so far. This zeolite 
was formed by an unprecedented one-dimensional to three-dimensional (1D-to-3D) topotactic condensation 
of a chain silicate. With a specific surface area of more than 1000 square meters per gram, ZEO-3 showed a 
high performance for volatile organic compound abatement and recovery compared with other zeolites and 


metal-organic frameworks. 


he size of the molecules that can enter, 
diffuse, and react into zeolites is limited 
by the size of their pores (J-4), which are 
typically described as being “of molecu- 
lar dimensions,” meaning that they are 
the size of small molecules (<7 A). For many 
applications, small pores enhance reaction 
and sorption selectivity (5), but for other ap- 
plications such as processing large molecules 
from petroleum or sorption and reaction of 
organic pollutants, stable zeolites with larger 
pores are in demand (6). Natural and synthe- 
tic zeolites have a fully connected, three- 
dimensional (3D) network of corner-sharing 


SiO, tetrahedra that are tectosilicates or frame- 
work silicates (7), with Si occasionally substi- 
tuted by other atoms. However, some zeolites 
are obtained in the form of 2D precursors 
(phyllosilicates or layered silicates) (7) that 
only become fully connected tectosilicate zeo- 
lites by condensation of their layers through a 
calcination procedure that is “topotactic” be- 
cause it does not alter the layer topology (8-10). 
The condensing layers can be obtained by direct 
synthesis or by the disassembly of certain 
zeolites, as in the so-called ADOR (assembly- 
disassembly-organization-reassembly) pro- 
cess (11). 


After several decades of extensive and sys- 
tematic zeolite synthesis studies (12), there 
have still been no reported examples or pre- 
dictions of a 3D zeolite obtained by conden- 
sation from a 1D precursor, either directly 
synthesized or obtained by disassembly of 
another zeolite. We report such a 1D-to-3D 
topotactic condensation from ZEO-2, a directly 
synthesized complex “zeolitic” chain silicate, into 
ZEO-3, a fully connected 3D extra-large-pore 
zeolite (“ZEO-n” refers to materials discovered 
and patented by the Anhui ZEO New Material 
Technology Co., China). This condensation 
does not alter the topology of the chain sili- 
cate, so it is topotactic. The resulting stable 
zeolite ZEO-3 exhibits very low density, a multi- 
dimensional system of interconnected extra- 
large pores (Fig. 1), and the presence in its 
structure of double four-membered ring units 
(D4Rs), which are small cubes of silica. For 
pure silica zeolites, this kind of unit is strained 
and up to now was believed to need a F anion 
near its center to be accessible for crystalliza- 
tion (13) because it has never been seen before 
in a silica zeolite synthesized without the use 
of F anions. 


Synthesis and structure of the precursor 
chain silicate 


We synthesized the complex chain silica zeo- 
lite precursor ZEO-2 using tricyclohexylmethyl- 
phosphonium (C,9H3¢P*, tCyMP) as an organic 
structure-directing agent (OSDA) from a gel 
of molar composition 1 SiOz: 0.5 tCyMPOH: 


10 H,O heated at 175°C (see the supplementary 
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materials). The structure of ZEO-2 was suc- 
cessfully solved ab initio by using eight con- 
tinuous rotation electron diffraction (CRED) 
(14) datasets. The pure silicate ZEO-2 crystals 
display a needle-like morphology (fig. SIA) 
and have a C-centered monoclinic cell with a = 
23.54.65(7) A, b = 24.7446(7) A, c = 14.4024(4) A, 
and B = 115.1974(9)° (tables S1 and S2 and fig. 
$2). ZEO-2 is a complex 1D chain silicate dec- 
orated with silanol and silanolate groups 
(Fig. 2A) that hold the structure together 
through numerous H bonds between adjacent 
chains (Fig. 2, B and C), with the tCyMP 
cations located in the interchain space (fig. 
S5). The cations were occluded intact, as dem- 
onstrated by ’C and ?’P nuclear magnetic 
resonance (NMR) (fig. S6), and amount to 
8.85 OSDA per unit cell according to C analy- 
sis (25.0 wt %). H bonds were observed in the 
*H magic-angle spinning (MAS) NMR spectrum 
as a broad resonance around 15.1 parts per 
million (ppm) (fig. S7), indicating a moder- 
ate to strong H bond (/5) corresponding to 
O---O distances (16) of ~2.51 A, in good agree- 
ment with the crystallographic distances of 
2.47 to 2.52 A. 

The 1D pure silica chains in ZEO-2 are 
aligned along the [001] direction (Fig. 2A) and 
surrounded by four identical chains in the ab 
plane (Fig. 2C). At the edge of the ZEO-2 chain, 
four silanols or silanolates form a single four- 
membered ring (S4R) that face, slightly dis- 
placed, an identical S4R from the next chain, 
with H bonding along the [110] and [1-10] 
directions connecting adjacent chains (Fig. 
2B). The high resolution of the ?°Si MAS NMR 
spectrum of ZEO-2 (Fig. 2D) reveals four Q? 
Si sites (-94.2, -95.8, -98.6, and -100.4 ppm) 
spanning a chemical shift range unprece- 
dented for Q? in zeolites, which are more 
typically centered at around -102 + 1 ppm. 
However, the values are well within the gen- 
eral Q? range in silicates (17). The spectrum 
also shows seven Q‘* Si sites (from -106.8 to 
-116.8 ppm) and is thus in good agreement 
with the crystallographic results (4 Q® and 
5 Q* all with the same multiplicity plus 2 Q* 
with half multiplicity; table S5). ?°Si{‘H} cross- 
polarization (CP) MAS NMR spectroscopy 
proved the existence of those four Q? Si sites in 
ZEO-2 (Fig. 2D, top). 


Condensation of 1D chains to a 3D 
extra-large-pore zeolite 


Upon calcination in air to remove the OSDA 
(600°C, 3-hour ramp, 6-hour plateau), silanol 
groups in adjacent chains condensed into 
Si-O-Si bridges with H,O elimination, result- 
ing in the pure silica zeolite ZEO-3 (Fig. 3, A 
to C), which maintained the needle-like mor- 
phology (fig. S1B). The condensation occurred 
between 370 and 390°C (fig. S8), coincident 
with the removal of organics (fig. $15). Phos- 
phorus residues were eliminated by washing 
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16MR 


B.51A 
10.36 A 
1am 


16MR 


8.00 A 
979A 


14MR 


Fig. 1. The extra-large-pore system in ZEO-3. Shown is the 3D system of the interconnected extra- 
large-pore system (left) and the crystallographic pore size of ZEO-3 (right). The van der Waals radius of 


O (2 x 1.35 A) has been subtracted. 


Fig. 2. Structure of the chain silicate ZEO-2. Only O atoms related to the subsequent condensation 
reaction are shown (small red spheres). Silicon atoms are shown as blue (always Q*) or black (Q° in 
ZEO-2 converting into Q* in ZEO-3). (A to C) A chain of ZEO-2 (A) is H bonded (B) to four adjacent 
chains (C). (D) 29Si MAS NMR spectrum (bottom) showing the resolution of the Q° and Q% silicon sites 
(n = 4 and 7 sites, respectively). The close proximity of Q° sites to H atoms is revealed in the 7°Si{/H} CP 
MAS NMR spectrum by their relative intensity enhanced by polarization transfer from close protons at 
short contact time (top, 1.5 ms). 


with water within an autoclave at 100°C for 
1 day or by reduction with H, (a mixture of 
H,/N, with 10/90 volume ratio) from the as- 
made ZEO-2 sample at 600°C with a 2-hour 
ramp and a 6-hour plateau. The structure of 
ZEO-3 was also solved ab initio by CRED with 
five datasets (tables S2 and S3 and fig. S3). The 
unit cell of ZEO-3 shrank to a = 21.504.6(8) A, 
b = 21.2757(8) A, c = 14.4638(4) A, and B = 
108.7196(1)° but maintained the same sym- 
metry as ZEO-2, as well as the topology of the 
chain. Whereas a 17% contraction of the struc- 


ture occurred along a and 0 axes, the ¢ axis 
underwent only a marginal expansion of 0.4%. 
To obtain more accurate atomic positions, the 
structures of ZEO-2 and ZEO-3, including the 
position of the disordered tCyMP in ZEO-2, 
were subsequently Rietveld refined against 
synchrotron powder x-ray diffraction data 
(SPXRD) (figs. S4 and S5 and tables S4 to S10). 
All of the unit-cell data given above corre- 
spond to the refined structures. The final re- 
fined unit-cell compositions of ZEO-2 and 
ZEO-3 were [Sigo076Ho4]|(CigHa6P)s and SigoOi60, 
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respectively (see the supplementary materials 
for details). 

During thermal treatment, neighboring S4Rs 
in ZEO-2 connect to each other to form a D4R 
(Fig. 3A) by condensation of the terminal Si-OH 
groups, yielding the fully connected framework 
of ZEO-3 (Fig. 3, B and C). The condensed solid 
is a true, noninterrupted, 3D extra-large-pore 
zeolite. The channel system of ZEO-3 is 3D 
with 16- x 14- x 14-membered ring (MR) chan- 
nels (Fig. 3, B and C) and full connectivity be- 
tween channels (Fig. 1). The fulfillment of the 
condensation process is demonstrated by the 
?°Si MAS NMR spectrum of ZEO-3, which re- 
veals total condensation: All Si atoms are Q* 
sites (Fig. 3D, bottom) with a negligible amount 
of Q® that could be assigned to connectivity 
defects, as proven by the very-low-intensity en- 
hancement by cross-polarization (Fig. 3D, mid- 
dle and top) and Fourier transform infrared 
spectra from self-supported pellets (fig. S16). 

The structural models obtained were fully 
corroborated by spherical aberration (Cs)- 
corrected scanning transmission electron mi- 
croscopy (STEM) (Fig. 4 and fig. S10), with 
which a faint signal corresponding to the 
tCyMP (C and P) was also identified between 
the chains of ZEO-2 (Fig. 4A) in the place that 
after condensation will become 14-MR pores 
in ZEO-3. The visualization of ZEO-2 along 
[001] could not be obtained because it was 
hampered by the larger thickness (because 
this is the long needle direction) and the ex- 
istence in the structure of atoms at different 
2 levels that are displaced from one another 
along x and y directions. The 14- and 16-MR 
pores of ZEO-3 are clearly visible (Fig. 4, B 
and C) together with the smaller 4-, 5-, and 
6-MR units in both materials (Fig. 4 and 
fig. S10). 


The o expansion of beta zeolite polymorphs 


The details of the topology are shown in Fig. 5 
and tables S11 and S12. The ZEO-2 chain is 
topologically identical to the one found in 
polymorph B of the beta zeolite, although in 
that zeolite, it is not an isolated chain but 
rather is embedded in the 3D framework. We 
use here the acronym BEB to refer to that 
polymorph, although this is not an accepted 
zeolite topology code. The chain in BEB and 
ZEO-3 is built by a large composite building 
unit (Fig. 5A) that is alternately rotated by 
+90° and -90° (Fig. 5B, dotted arrows). If the 
rotation were performed always in the same 
sense (that is, always by either +90° or -90°; 
Fig. 5C, dotted arrows), then a chiral chain 
would result, which is in fact embedded in 
the chiral BEA polymorph of the beta zeolite 
(space group P4,22 or P4322). Thus, conden- 
sation of such a chain in a way similar to the 
condensation of ZEO-2 would result in anew 
chiral 16- x 14- x 14-MR hypothetical zeolite. 
ZEO-3 (Fig. 5D) and this hypothetical chiral 
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Fig. 3. 1D-to-3D topotactic condensation into the extra-large-pore framework silicate ZEO-3. 

(A to C) During calcination of ZEO-2, condensation of Q° sites through dehydroxylation connects two S4Rs 
to make a DAR (A), through which each chain is bonded to four adjacent chains, resulting in the extra- 
large-pore ZEO-3 with 14-MR (B) and 16-MR (C) channels. (D) The corresponding 295i MAS NMR spectra 
(bottom) are dominated by Q* sites with almost no Q° defects and thus little intensity enhancement in the 
29Si{H} CP MAS NMR under short (middle, 1.5 ms) or long (top, 6 ms) contact times. 


Fig. 4. Cs-corrected STEM visualization of the 1D silicate chain ZEO-2 and the 3D zeolite ZEO-3. 

(A to €) ZEO-2 along [110] projection (A) and ZEO-3 showing the extra-large 14-MR along the [110] projection 
(B) and the 16-MR along the [001] projection (C). All other rings (4-, 5-, and 6-MRs) are clearly visible. 

To facilitate image interpretation, the schematic models (Si atoms are shown in blue and O atoms in red) have 
been superimposed, and the simulated STEM images are also included in the right bottom inset. 


zeolite (Fig. 5E) correspond to the o expan- 
sion of polymorphs BEB (Fig. 5F) and BEA 
(Fig. 5G), respectively, of the beta zeolite 
(18, 19). A minimization of both structures 
using the general utility lattice program (GULP) 
suggests that the energies of both ZEO-3 and 
o-BEA are close to the energy-density relation- 
ship normally encountered in zeolites (fig. 
S17 and table S14). Because ZEO-3 has been 
realized, we believe that the hypothetical extra- 
large-pore o-BEA zeolite might be a reasonable 
target for future studies. 


Properties of ZEO-3 


ZEO-3 is a stable, fully connected silicate zeo- 
lite containing 3D-interconnected pores opened 
only through extra-large windows. The crys- 
tallographic pore sizes of ZEO-3 are 10.36 x 
8.51 A and 9.79 x 8.00 A for the 16- and 14-MR 
pores, respectively (Fig. 1). The 3D extra-large- 
pore nature of ZEO-3 resulted in a very low 
framework density (FD) value (12.76 tetrahe- 
dral atoms per 1000 A*). Compared with the 
other known stable, low-density (alumino) 


silicate zeolites, including FAU, EMT, *BEA, 
BEC, ISV, and IWV and the recently re- 
ported PST-2, PST-32 (20), and ZEO-1 (6), this 
value is the lowest and puts ZEO-3 as the crys- 
talline silica polymorph with the most open 
framework (table $13). The calculated density 
of ZEO-3 is just 1.27 g/cm, less than half the 
density of quartz (2.65 g/cm?) and near that of 
water. In fact, ZEO-3 breaks the observed ten- 
dency between the framework density and the 
size of the smallest rings in the zeolite struc- 
ture (27). For an average smallest ring of 4.25, 
the predicted minimum FD (27) is 13 T atoms 
per 1000 A®, which is greater than the value 
for ZEO-3. Compared with the real values 
of noninterrupted zeolites containing 4- and 
5-MRs, ZEO-3 is well below the lowest cal- 
culated FD of ISV and IWV (15.0 T-atoms 
per 1000 A’). 

The observed N, and Ar adsorption/desorption 
isotherms (type Ia) of ZEO-3 revealed high spe- 
cific surface areas of 989 and 1032 m?/g (figs. 
S11 and S12), respectively. The nonlocal den- 
sity functional theory method applied to the 
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BEA chain 


Fig. 5. Topology of ZEO-3 and a hypothetical chiral extra-large-pore zeolite. 
(A) Composite building units: the small units and the large unit made from them. 
(B and C) ZEO-2 chain (B) and BEA chain (C) built by attaching successive 

large units, viewed along two different directions. (D to G) The tiling structures 

of ZEO-3 (D), the o-expanded BEA (E), and the polymorphs BEB (F) and BEA (G) 
of the beta zeolite. Dotted arrows indicate topologically identical large units (units 1, 


Ar adsorption data calculated mean pore sizes 
of 10.8 and 8.8 A (fig. $13), which match well 
with the crystallographic results. The extra- 
large pores of ZEO-3 allowed the diffusion and 
adsorption of large molecules such as Nile 
blue (fig. S14), suggesting the potential for the 
removal of large organic pollutants from waste 
liquid streams. 

Adsorption has been considered as an energy- 
saving candidate for volatile organic com- 
pound (VOC) abatement and recovery (22). 
The development of sorbents with high ad- 
sorption capacity, water vapor resistance, and 
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o-BEA (P4,22 or P4,22) 


easy regeneration is critical for a successful ad- 
sorption technology (23). Zeolites are among 
the best adsorbents for VOCs given their dis- 
tinctive microporosity, high adsorption capacity, 
and nonflammability (23, 24). The adsorption 
equilibrium capacities of toluene and water 
vapor on ZEO-3 were larger than those on beta 
zeolites, which have smaller pores (Fig. 6A). 
ZEO-3 exhibited much longer breakthrough 
time (better dynamic capacity) than beta zeo- 
lites, with little interference from water (Fig. 
6B). The main desorption peak of toluene oc- 
curred at a lower temperature for ZEO-3 than 


BEB (C2/c) 


BEA (P4,22 or P4,22) 


2, 3, and 4) rotated by alternating +90° -90° +90° and -90° in (B), or 
nonalternating +90°, +90° +90° and +90° or -90° -90°, -90° and -90° in (C). In 
(D) to (G), the dotted lines separate neighboring ZEO-2 or BEA chains; in (D) 
and (E), the newly formed D4Rs between neighboring ZEO-2 or BEA chains 

are highlighted in green; in (F) and (G), no D4Rs exist because S4Rs are shared 
between neighboring ZEO-2 or BEA chains. 


for beta (Fig. 6C), making its thermal regen- 
eration easier. Thus, ZEO-3 outperforms beta, 
a reference zeolite for this application (25, 26), 
in terms of adsorption capacity and regeneration 
potential. Comparison of ZEO-3 with commer- 
cial zirconium 1,4-dicarboxybenzene UiO-66, 
one of the most stable metal-organic frame- 
works (MOFs) (27), suggested a similar per- 
formance for the fresh materials (fig. S18). 
However, after recycling, the performance of 
ZEO-3 was maintained, whereas that of UiO-66 
was compromised. At the end of five adsorption 
cycles, the PXRD pattern of UiO-66 revealed 
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Fig. 6. Application of 3D extra-large-pore zeolite ZEO-3 in VOC removal. Shown are VOC adsorption isotherms (A), breakthrough adsorption (B), and desorption 
curves (C) on a ZEO-3 (red) and a beta (black) zeolite. 


that it had been destroyed, whereas ZEO-3 
remained stable (fig. S19). Additionally, the 
desorption temperature of UiO-66 was sig- 
nificantly higher than that of ZEO-3 (fig. 
$20), indicating a poorer regeneration ability. 
Phosphorous-free ZEO-3 had a high thermal 
stability (1100°C 10°C/min, 1-hour plateau) and 
hydrothermal stability (760°C, 10% H,O, 3 hours), 
which may ease its application in real condi- 
tions (fig. S9). It is also possible to introduce 
active sites (such as Ti) into ZEO-3 through a 
one-pot synthesis method (figs. S21 and S22). 
Ultraviolet-visible spectra reveal that Ti-ZEO-3 
exhibits both tetra-coordinated (~210 nm) and 
hexa-coordinated (~270 nm) Ti species (fig. 
$23). The catalytic properties of Ti-ZEO-3 will 
be investigated in a future study. 


Concluding remarks 


The fact that ZEO-3 presents structural fea- 
tures that sharply depart from previous ob- 
servations (silica D4R units without F anions 
and FD lower than predicted) deserves some 
consideration. Those previous observations 
refer to materials directly synthesized by hy- 
drothermal crystallization. The successful syn- 
thesis of ZEO-3 demonstrates that materials 
that may be considered not accessible by 
direct synthesis can be obtained by postsyn- 
thesis transformations. This has also been ob- 
served for zeolites obtained by the ADOR 
process, which afforded “unfeasible” zeolites 
(28) or the hybrid guest-host pure silica STW, 
which was also predicted to be unfeasible by 
direct synthesis (29). This observation allows 
us to foresee new materials developed by 1D- 
to-3D topotactic condensation, as was done 
using the ADOR process. It is worth mention- 
ing that the final step in the ADOR process is a 
2D-to-3D condensation that is similar to the 
1D-to-3D process reported here but produces a 
systematic reduction in pore size with regard 
to the parent material. 
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SOLAR CELLS 


Radical polymeric p-doping and grain modulation for 
stable, efficient perovskite solar modules 
Shuai You"*+, Haipeng Zeng", Yuhang Liu*}, Bing Han+, Min Li, Lin Li’, Xin Zheng’, Rui Guo’, 


Long Luo’, Zhe Li*, Chi Zhang®, Ranran Liu’, Yang Zhao’, Shujing Zhang’, Qi Peng’, Ti Wang’, 
Qi Chen®, Felix T. Eickemeyer’, Brian Carlsen”, Shaik M. Zakeeruddin?, Ligiang Mai®’, 


Yaoguang Rong"°*, Michael Gratzel**, Xiong Li** 


High-quality perovskite light harvesters and robust organic hole extraction layers are essential for 
achieving high-performing perovskite solar cells (PSCs). We introduce a phosphonic acid—functionalized 
fullerene derivative in mixed-cation perovskites as a grain boundary modulator to consolidate the crystal 
structure, which enhances the tolerance of the film against illumination, heat, and moisture. We also 
developed a redox-active radical polymer, poly(oxoammonium salt), that can effectively p-dope the 
hole-transporting material by hole injection and that also mitigates lithium ion diffusion. Power 
conversion efficiencies of 23.5% for 1-square-centimeter mixed-cation-anion PSCs and 21.4% for 
17.1-square-centimeter minimodules were achieved. The PSCs retained 95.5% of their initial efficiencies 
after 3265 hours at maximum power point tracking under continuous 1-sun illumination at 70° + 5°C. 


rganic-inorganic hybrid perovskites have 

attracted extensive research interest for 

photovoltaic (PV) applications (7-3). Al- 

though the power conversion efficiency 

(PCE) of organic-inorganic hybrid perov- 
skite solar cells (PSCs) can now exceed 25% 
(4-7), such high efficiencies have been ob- 
tained only with small-area PSCs (aperture 
areas < 0.1 cm”). Advances reported for small 
laboratory cells do not necessarily transfer 
to perovskite solar modules (PSMs), because 
issues such as film inhomogeneity, resistive 
losses from photocurrent collection, and in- 
stability resulting from ion diffusion escalate 
steeply with device area (8). Thus, larger-area 
devices based on current materials and fabri- 
cation techniques often have substantial per- 
formance losses (9-12), so it is essential to 
deposit compact perovskite light-harvesting 
layers and charge-transporting layers with low 
defect densities over larger areas. 

The antisolvent spin-coating process has 
been widely used to deposit perovskite thin 
films (13-15). However, this technique cannot 
be applied for casting large-area perovskite 
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films. Commonly used approaches to scale up 
device size use “perovskite inks” that are com- 
patible with processes such as slot-die coating, 
doctor-blade coating, and vacuum flash—assisted 
solution processing (VASP) (J6-79). Besides 
morphology control, it is also necessary to pas- 
sivate defects, suppress unfavorable ion diffu- 
sion, and promote electronic charge carrier 
transport within the perovskite films (9, 20, 27). 
Functional molecular modulators were intro- 
duced into the grain boundaries (GBs) to heal 
defects and prevent ion migration (22-29). 
Our group has introduced phosphonic acid de- 
rivatives [-PO(OH).] as more efficient passiva- 
tion and cross-linking agents compared with 
carboxylic acids (26, 27). However, prior work 
generally made use of aliphatic backbones— 
which are electrochemically inert and do not 
interact with perovskite—to connect the func- 
tional groups. Hence, designing a grain mod- 
ulator that can facilitate intergrain electron 
transfer, while strengthening their binding, is 
essential to fabricating high-quality large-area 
perovskite films for PSMs. 

The fabrication of high-performance PSMs 
will also require both shortening the process- 
ing time and increasing the conductivity and 
stability of the hole-transporting layer (HTL). 
For an n-7-p device, typical hole-transporting 
materials (HTMs) such as 2,2'7,7'-tetrakis-(N,N- 
di-p-methoxyphenyl-amine)-9,9'-spirobifluorene 
(spiro-MeOTAD) or poly[bis(4-pheny]l)(2,4,6- 
trimethylphenyl)amine] (PTAA) are deposited 
on top of the perovskite layer. Because of their 
low intrinsic conductivity, a p-type doping pro- 
cess with hygroscopic lithium salts, for exam- 
ple, lithium bis(trifluoromethylsulfonyl)imide 
(LiTFSI), is usually applied. The conventional 
doping process of spiro-MeOTAD typically takes 
hours and results in by-products (e.g., Li,O,) 
that will seriously degrade cell performance 
(30-32). Efforts have been devoted to devel- 


oping doping strategies for HTMs (33) in PSCs. 
Kong et al. doped spiro-MeOTAD by bubbling 
carbon dioxide through HTM precursor solu- 
tion under ultraviolet (UV) light, thereby de- 
creasing the doping time (34). Zhang et al. 
developed a method that avoids post-oxidation 
by using stable organic radicals and ionic salts 
as dopants (35). Although proven effective for 
small-area laboratory cells (36, 37), efficient 
HTM doping for larger-area PSMs has not been 
reported yet. 

We introduce two cooperative strategies to 
address these issues and realize efficient PSMs 
that achieve an unprecedented operational 
stability and, at the same time, can be manufac- 
tured on a large scale at a competitive produc- 
tion rate. We used a redox-active polymeric 
dopant consisting of a polyoxyethylene back- 
bone endowed with 2,2,6,6-tetramethylpiperidin- 
1-yl)oxyl, abbreviated as PPO-TEMPO (molecular 
structure shown in Fig. 1A), as an oxidant 
to afford p-doping of the HTLs through hole 
injection. This strategy not only shortened the 
doping time to a few seconds, which enables 
high production rates for PSMs, but also stab- 
ilized the device by suppressing the Li ion ag- 
gregation and diffusion into the adjacent layers. 
We also synthesized the fullerene derivative 
4-(1',5'- dihydro-1'-methyl-2'H-[5,6] fullereno- 
C60-/,,-[1,9-c]pyrrol-2'-yl) phenylphosphonic 
acid (CPPA; molecular structure shown in Fig. 
1A) to mitigate GB defects of mixed-cation 
perovskites. CPPA features a conductive full- 
erene core functionalized by phosphonic acid 
anchoring groups that can interact with perov- 
skites through hydrogen bonding, Pb coordina- 
tion, and fullerene—halide radical interactions 
(26-29). By self-assembling at the GBs, CPPA 
enhanced the tolerance of the perovskite film 
against illumination, heat, and moisture and 
accelerated intergrain electronic charge carrier 
transport. The combined action of the two agents 
suppresses interfacial ion migration, contribut- 
ing in acrucial manner to the high performance 
of our devices. These approaches allowed us 
to fabricate efficient PSCs and PSMs showing 
outstanding long-term operational stability. 


PSM fabrication and performance 


The PSCs and PSMs we fabricated have the 
layered architecture indium tin oxide (ITO)/ 
tin(IV) oxide (SnO.)/perovskite/HTL/Ag or 
Au (Fig. 1B). The triple-cation perovskite 
CsFAMAPD(I,_,,Br,)3 [where Cs is cesium, FA 
is HC(NH,)s, and MA is CH;NHsg] (38, 39) was 
deposited by the VASP method (see supple- 
mentary materials for device fabrication de- 
tails). The VASP method enables fast removal 
and recuperation of solvent, which boosts the 
rapid crystallization of the dimethyl sulfoxide 
(DMSO)/lead(D) iodide (PbI) Lewis base ad- 
duct intermediate phase (14, 75). Upon ther- 
mal annealing, highly oriented, crystalline 
perovskite films can be obtained (19, 40). This 
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Fig. 1. Device configuration and photovoltaic performance. (A) Molecular 
structures of CPPA, PPO-TEMPO, and PPO-TEMPO(TFSI). (B) Schematic 
outline of the architecture of as-fabricated PSCs, and the stabilizing strategy 
for the perovskite layer and HTL. The perovskite layer had a composition 
Cso.05FAo.g5MAo.10Pb(Io.97Bro.o3)3- (€) Cross-sectional SEM images of 


perovskite/HTL/Ag aged under illumination at 75°C for 
area of the perovskite and HTL films is marked by the d 


method allows deposition of perovskite films 
on a large substrate area and can be applied to 
continuous production processes. Previously, 
we have demonstrated operationally stable 
PSCs using CsFAMA perovskite polycrystal- 
line film and Spiro-MeOTAD/LiTFSI/4-tert- 
butyl pyridine (tBP) blend film (abbreviated as 
Spiro-Li) (39, 40). However, the cells could only 
deliver stable electricity output at a tempera- 
ture below 65°C. When the cells were stressed 
at a higher temperature, cell performance de- 
cayed severely owing to the degradation of the 
Spiro-Li layer. At elevated temperatures, small 
pinholes within the Spiro-Li layer were formed, 


00 hours. The degraded 
lashed line. In both images, 


phological deformation that provided chan- 
nels for the escape of volatile and corrosive 
ingredients from the perovskite layer (41-44). 
Upon introducing the PPO-TEMPO doping 
strategy to the Spiro-Li layer, the bulk morphol- 
ogy of the perovskite/HTL bilayer featuring 
well-defined interfaces was markedly stabi- 
lized under illumination at 75°C for 100 hours 
(Fig. 1C). Time-of-flight secondary ion mass spec- 
trometry (TOF-SIMS) measurements showed 
that PPO-TEMPO doping could also minimize 
the interfacial diffusion of iodide from the 
perovskite layer to the HTL (fig. S1). 

To evaluate the effect of the CPPA modu- 


enlarged, and ultimately caused severe mor- 
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lator and PPO-TEMPO doping on cell per- 


Ag electrodes had been removed for conducting TOF-SIMS measurements of the 
HTL layer (fig. S1). (D) J-V curves and photovoltaic parameters of a representative 
cell that used CsFAMA and Spiro-Li (control), CsFAMA-CPPA and Spiro-Li (CPPA), 
and CsFAMA-CPPA and (Spiro-Li)-PPO-TEMPO (PPO-TEMPO). (E) J-V curves 

and stabilized output of the PSM recorded in the lab using a mask with an aperture 
area of 17.1 cm’. The photovoltaic parameters certified by an accredited testing 
laboratory are also presented for comparison. (F) A photograph of the PSM. 


formance, we fabricated and tested a batch of 
20 cells. The representative current density- 
voltage (J-V) curves are compared in Fig. 1D, 
and the statistical results are shown in table 
S1 and fig. S2. When CPPA was introduced 
into the perovskite layer, the fill factor (FF) in- 
creased from 77.3% to 80.0%, and the open- 
circuit voltage (Voc) increased from 1.13 V to 
1.15 V. We attribute the improved cell perform- 
ance to enhanced intragrain charge transport 
and suppression of nonradiative charge re- 
combination by CPPA molecules at the GBs 
(45). Meanwhile, when PPO-TEMPO doping 
was applied, the Voc was further increased to 
1.18 V, which we attribute to a more favorable 
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band alignment. For the cell using CsFAMA- 
CPPA as the absorbing layer and (Spiro-Li)- 
PPO-TEMPO as the HTL, we achieved a PCE of 
23.5% with negligible hysteresis (fig. $3). We 
confirmed the measured short-circuit density 
(Jsc) of 24.8 mA cm™ by recording incident 
photon-to-current efficiency (IPCE) spectra, 
which yield an integrated current density of 
24.7 mA cm”. This value is in good agreement 
with the measured Jgc value and indicated 
that there was negligible spectral mismatch 
between our solar simulator and the standard 
global AM 1.5 solar emission (fig. S4). 

To highlight the advantages of using the 
VASP method to deposit large-area perov- 
skite films, we fabricated PSMs consisting 
of six subcells connected in series and ob- 
tained a PCE of 21.4% with an aperture area 
of 17.1 cm? (the stabilized PCE reached 21.1%) 
(Fig. 1E). We also sent one of the minimodules 
to a PV measurement laboratory, and the vali- 
dated PCE reached 19.6% (fig. S5). A photo- 
graph of the PSM is presented in Fig. 1E. The 
short-circuit photocurrent (/sc) of the mod- 
ule reached 66.3 mA, corresponding to a Jgc 
of 23.3 mA cm ® for the subcells, which was 
94% of the Jsc obtained with lab-scale cells. 
As confirmed by scanning electron micros- 
copy (SEM), the perovskite film deposited by 
the VASP method showed uniform grain size 
and height at different locations over a 10-cm 
scale (fig. S6). 


Grain modulation of the perovskite layer 


Density functional theory calculations revealed 
that CPPA associated with lead halide perov- 
skites through hydrogen bonding as well as 
coordinative bonding and fullerene-iodide 
interactions (see Fig. 2A, fig. S7, table S2, and 
supplementary note 1 in the supplementary 
text for details). Accordingly, we postulated that 
CPPA associated with GBs, thereby enhancing 
the stability of the perovskite film (26, 27). To 
prove this, we used cryo-transmission elec- 
tron microscopy (which minimizes the electron 
beam damage to the perovskites) to analyze the 
microstructures of aged perovskite films (46). 
The pristine and CPPA-modulated perovskite 
films were exposed to continuous illumination 
at 85°C. Representative high-resolution trans- 
mission electron microscopy (HRTEM) images 
of selected areas are shown in Fig. 2B and fig. 
S8. For the aged CsFAMA film, three distinct 
crystal structures were observed, and the in- 
terplanar spacings (ISs) and fast Fourier trans- 
form patterns (fig. S9) of a selected area were 
analyzed. The crystalline region with an IS of 
3.16 A was identified as the (200) domain of 
a-FAPbI;, and the region with an IS of 6.90 A 
corresponded to the hexagonal PbI, domain 
(47). For the aged CsFAMA-CPPA film, crystal- 
line regions corresponded to (220) and (210) 
domains of o-FAPbI3, and no PbI, domain was 
observed (fig. S10). Between the crystalline re- 
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gions there was an amorphous region, which 
is considered beneficial for stress relaxation 
within crystalline perovskite films. 

We analyzed the stabilization effect of CPPA 
on mixed-halide perovskites by measuring the 
photoluminescence (PL) properties of the CPPA- 
modulated perovskite films under blue-light 
irradiation. A typical mixed-halide perovskite 
composition of MAPb(Io.;Bro.5)3 was used as 
a model. When the films were exposed to 
high-intensity (13.0 W cm”) blue-light ir- 
radiation, significant phase evolution was 
observed (Fig. 2C). Both MAPDb(Ip;Bro.5)3 and 
MAPD(Io.5Bro.5)3-CPPA showed PL peaks of an 
iodine-rich (I-rich) phase at ~747 nm after 
1 min of irradiation, resulting from the regular 
photoinduced phase segregation (48-57). After 
10 min of irradiation, a phase decomposition 
for the pristine MAPb(Ip;Bro.5)3 emerged that 
manifested itself by a weak PL peak at 537 nm, 
corresponding to the characteristic emission 
peak of MAPbBr; (52-54). This indicated that 
the I-rich phase decomposed under the intense 
light illumination, leaving MAPbBr; behind, 
which appeared to be less susceptible than 
MAPDbI; to decomposition under the intense 
light beam. After 15 min, the intensity of the 
537-nm peak became comparable to that of 
the I-rich phase with a peak at 737 nm. This 
indicated that MAPbBrs has become the domi- 
nant phase in the mixed-halide perovskite 
film. The decomposition of MAPbI; may be 
caused by either photolysis or thermolysis, ac- 
cording to the equation that follows (see also 
supplementary note 2) 


MAPbI, = I, + Pb + MAI (1) 


In contrast, the CPPA-treated MAPb(Ip;Bro5)3 
resisted degradation under prolonged exposure 
to the intense blue light. Notably, the 747-nm 
PL peak remained unchanged within 15 min 
of irradiation. The stable PL emission indi- 
cated that the presence of CPPA prevented the 
decomposition of MAPbI, under high-intensity 
irradiation. The PL imaging results as a func- 
tion of irradiation time are presented in Fig. 2D 
and fig. S11. For MAPb();Bro5)3, it was found 
that green spots started to emerge at the illu- 
mination area after 10 min and became more 
intense after 15 min of irradiation. By contrast, 
the irradiated area of CPPA-treated MAPb 
([o.5Bro.5)3 Maintained its red emission during 
the whole irradiation process. This implied 
that CPPA could effectively enhance the photo- 
stability of mixed-halide perovskites (see sup- 
plementary note 2 for details). 

The stabilization of MAPbI3-rich grains 
formed during the phase segregation by CPPA 
clearly arises from its prevention of their photo- 
or thermally induced decomposition under 
blue-light irradiation. Our detailed analysis 
shows that MAPbI; decomposes into iodine, 
methylammonium iodide (MAI), and elemen- 


tal lead. This process is triggered by the loss 
of iodine from the lattice, leaving MAI and 
Pb behind. The prevention of decomposition 
by CPPA indicates that its presence at the 
perovskite GBs impairs the escape of iodine. 
Such a stabilization effect may also contribute 
to the HTL, because the diffusion of iodide 
ions from the perovskite into the HTL can re- 
duce the oxidized HTMs and cause dedoping 
issues (55). 

We found that CPPA can also improve the 
hydrophobicity and moisture stability of the 
modulated perovskite film. We exposed the pe- 
rovskite to ambient atmosphere with a high 
relative humidity of 70% and found that pris- 
tine CsFAMA rapidly degraded to 5-FAPbI, 
and PbI,, whereas CSFAMA-CPPA maintained 
its original crystal structure (Fig. 2, E and F, 
and fig. S16). We performed conductive atomic 
force microscopy (c-AFM) measurements to 
investigate the charge transport properties of 
the perovskite films (fig. S17). Compared with 
pristine CSFAMA, CsFAMA-CPPA presented 
higher current flow through the film. In par- 
ticular, the brighter contrast at GBs indicates a 
suppressed charge-transporting barrier across 
the GBs of CsFAMA-CPPA. Absolute spectral 
photon flux ® was measured to derive the PL 
quantum yield (PLQY) of the perovskite films 
(Fig. 2G), which increased from 3.27% to 6.41% 
after doping the perovskite with CPPA. The 
quasi-Fermi level splitting AE;/q (where Ey is 
the Fermi level and q is the elementary charge) 
increased from 1.176 to 1.192 V for CSFAMA 
versus CSFAMA-CPPA (table 83) (56, 57) and 
reflected the suppression of nonradiative re- 
combination with the CPPA additive. 


Radical polymeric p-doping of the HTL 


Effective and homogeneous doping and stabi- 
lization of the HTM is critically important for 
high-performance PSMs. In addition, it is nec- 
essary to develop effective HTM doping meth- 
ods to facilitate the fabrication of PSMs. We 
oxidized the neutral radical polymer PPO- 
TEMPO to a poly(oxoammonium salt) of PPO- 
TEMPOC(TFSI) (see the materials and methods 
section of the supplementary materials for 
synthesis details) and incorporated it as a 
p-dopant of spiro-MeOTAD. PPO-TEMPO/ 
PPO-TEMPO(TFSI) had a redox potential of 
0.925 V compared with a normal hydrogen 
electrode (NHE), as determined by electro- 
chemical measurements (fig. S18), which is 
greater than that of spiro-MeOTAD/spiro- 
MeOTAD:-* (0.72 V compared with NHE) and 
provides a driving force of ~200 mV to oxidize 
spiro-MeOTAD through a one-electron trans- 
fer reaction (58). Upon the reduction of the 
oxoammonium, this cationic pendant group 
formed the stable radical polymer poly(4- 
glycidyloxy-2,2,6,6-tetramethylpiperidine- 
1-oxyl) (PPO-TEMPO), which can interact with 
Li* ions and stabilize the HTL (Fig. 3A) (59). 
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Fig. 2. Working mechanism of CPPA and luminescence enhancement in perov- 


skite films. (A) The scheme of interaction 
(B) Cryo-HRTEM images of perovskite film 


modes between CPPA and perovskites. 
S aged under continuous illumination at a 


temperature of 85°C for 7 days in nitrogen atmosphere. The magnifications of the 
boxed areas are shown in fig. S7. (€) The evolution of PL spectra of MAPb(Io5Bros)3 


and MAPb(lo.sBros)3-CPPA films under co! 


ntinuous blue-light irradiation (mercury 


lamp; wavelength: 460 to 490 nm; intensity: 13.0 W cm’). (D) PL imaging results 
of MAPb(losBros)3 and MAPb(Io5Bros)3-CPPA films after 15 min of blue-light 
irradiation. (E) X-ray diffraction (XRD) patterns and (F) UV-vis spectra of CsFAMA 
and CsFAMA-CPPA perovskite films aged in a humid environment (relative humidity: 
70 to ~75%) for 0 to ~168 hours. (G) Photoluminescence spectra photon flux ® 
of the perovskite films. a.u., arbitrary units. 


The changes in the UV-visible (UV-vis) ab- 
sorption spectrum of a spiro-MeOTAD solu- 
tion in chlorobenzene upon PPO-TEMPO doping 
are shown in Fig. 3B. Increasing the amount 
of dopant gave rise to an absorption peak at 
~520 nm from the monocation radical of spiro- 
MeOTAD>" that increased linearly with dopant 
concentration (30). We estimated the doping 
efficacy, defined by the molar ratio of the oxi- 
dized spiro-MeOTAD to the added dopant, to 
be ~ 90%. Although spiro-MeOTAD was oxi- 
dized to the cation radical spiro-MeOTAD-+, 
PPO-TEMPO(TFSI) was reduced to its neutral 
polymeric radical state in the doped HTMs. 
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The doping effect was further investigated 
by conductivity and UV photoelectron spectros- 
copy (UPS) measurements. The conductivity of 
the Spiro-Li film was evaluated by measuring 
the J-V curves of a symmetrical device with 
Au/Spiro-Li/Au architecture. As shown in Fig. 
3C, the undoped Spiro-Li film kept under Ny 
showed quite a low electrical conductivity of 
1.19 x 10-7 S cm“, but the Spiro-Li film with 
PPO-TEMPO doping had a high electrical con- 
ductivity of 3.32 x 10-° S cm without expo- 
sure to oxygen (see supplementary note 3 for 
details). UPS results confirmed that PPO- 
TEMPO doping improved the energy band 


alignment in the perovskite/HTL/Au stack (see 
figs. S19 and S20 and supplementary note 4 for 
details). The deepened Fermi level and increased 
conductivity confirmed efficient p-type doping 
with PPO-TEMPO/PPO-TEMPOCTFSI). 
Compared with the gas-doping strategies 
with O, or COs, which usually involve photo- 
induced gas-liquid or gas-solid reactions (34), 
PPO-TEMPO doping in HTM solution was 
more efficient and would largely simplify the 
doping process. The Li*/PPO-TEMPO radical 
interaction (59) can also prevent the migration 
of lithium cations under heat and voltage 
stress. To evaluate this effect, the Spiro-Li 
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Fig. 3. Doping effect and stabilized bulk morphology. (A) Schematic 
illustration of the doping mechanism of PPO-TEMPO and PPO-TEMPO(TFSI) 

of organic HTMs, and the interaction mode between the reduction product 
PPO-TEMPO and LiTFSI additive. (B) The absorption spectra of Spiro-Li without 
doping and Spiro-Li with PPO-TEMPO doping of different ratios. (©) The J-V 
curves measured with symmetrical device of Au/Spiro-Li/Au for evaluating the 


films with and without PPO-TEMPO doping 
were aged under heat at 75°C in dry air. After 
100 hours, voids, microdefects, and particles 
emerge on the film without PPO-TEMPO, 
whereas the film with PPO-TEMPO doping 
maintained its initial pinhole-free surface 
morphology (Fig. 3, D and E). The voids repre- 
sent inhomogeneities within the HTL. 

We correlated these morphological deforma- 
tions with inhomogeneous distribution of Li* 
by analyzing the films with the EELS (electron 
energy loss spectroscopy) mapping technique 
(Fig. 3, F and G; see supplementary note 5 for 
details). In the HRTEM morphology images 
(fig. S21), several pits were observed, which 
was consistent with those in SEM images. For 
EELS mapping images, the relative color shift 
depicts the fluctuation in local Li concentra- 


292 20 JANUARY 2023 * VOL 379 ISSUE 6629 


1000 


PPO-TEMPO doping. 


tion. In the control sample, the higher-contrast 
white spots highlighted by yellow cycles indi- 
cate a high Li content. These Li-rich domains 
of ~50-nm size were associated with the pits 
shown in the HRTEM morphology image. Thus, 
our direct mapping of Li in a control HTL film 
revealed nanoscale inhomogeneities (nonstoi- 
chiometric regions) associated with local phase 
separation. It appeared that PPO-TEMPO dop- 
ing could improve the PSC stability by pre- 
venting such local Li* accumulation. 

In addition, this polymeric p-doping strat- 
egy also provides an effective solution of HTM 
doping for the fabrication of PSMs. As shown 
in fig. S22, PSM using the regular Spiro-Li HTL 
required an ambient air exposure process of 
more than 24 hours. Such a process will un- 
duly slow down the fabrication of PSMs. By 
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conductivity of the HTL with and without PPO-TEMPO doping in air and nitrogen 
(the inset shows the extracted conductivity values). The aged samples were 
heated at 75°C for 30 min. (D and E) Surface SEM images of Spiro-Li films 
with and without PPO-TEMPO doping after aging at 75°C for 100 hours. (F and 
G) Li element EELS mapping images of aged Spiro-Li films with and without 


contrast, the PSM with PPO-TEMPO doping 
exhibited its optimal photovoltaic perfor- 
mance with modular PCE > 20% right after 
fabrication. Our approach has provided effec- 
tive HTM doping of the PSM and will facilitate 
future mass production of PSMs. 

We investigated the operational stability of 
as-fabricated cells by conducting maximum 
power point (MPP) tracking under continuous 
illumination. The target cells treated with 
CPPA and PPO-TEMPO doping maintained 
95.5% of their initial PCE after 3265 hours 
under continuous illumination at 70° + 5°C, 
whereas the control cells without PPO-TEMPO 
doping lost >30% of their initial PCEs in just 
1000 hours (Fig. 4A). The void-like structures 
and inhomogeneities within HTLs likely im- 
peded the top-down charge transport during 
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Fig. 4. Device stability and degradation analysis. (A) Evolution of the PCE over time, as measured by MPP tracking of encapsulated cells under light soaking 
with AM1.5G simulated illumination at a temperature of 70° + 5°C. (B) TOF-SIMS depth profiles of the iodine and Ag in SnO2/perovskite/HTL/Ag stacked layers. 
PO doping. 


(C) Thermal stability test performed at 85°C for the cells using Spiro-Li as the HTL with and without PPO-TE 


operation of PSCs and would lead to severe 
degradation of the devices by triggering the 
reactions between the perovskite and metal 
back contacts (41, 43). TOF-SIMS results proved 
that PPO-TEMPO doping can effectively sup- 
press the intrusion of iodide into the HTL (Fig. 
4B) and prevent the reaction between the pe- 
rovskite layer and metal electrode. 

We also attributed the enhanced stability to 
the suppressed Li* diffusion in the HTL (60) 
and conducted TOF-SIMS depth profiling of Li* 
along the vertical direction through the stacks 
(fig. S23, and see supplementary note 6 for 
details). When PSCs were aged with a Spiro-Li 
layer, Li* was found at a higher concentration 
at the perovskite/SnO, interface. Li* intrusion 
into the perovskite active layer could also re- 
sult in decomposition of the perovskite, leading 
to cell performance degradation (67). However, 
only a small amount of Li* diffused out from 
the aged PPO-TEMPO-doped HTL, which we 
attributed to the formation of Lewis acid-base 
interaction between Li* and the ether linkages 
in the PPO-TEMPO polymer backbone (62). 

To investigate the stability of the cells under 
thermal stress, we tested PSCs under prolonged 
heat exposure at 85°C according to ISOS (Inter- 
national Summit on Organic Photovoltaic Stab- 
ility) procedures (63). Given the glass-transition 
issues of Spiro-MeOTAD, the cells suffered a 
certain extent of initial losses (Fig. 4C). For the 
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cells using Spiro-Li as the HTL, power output 
rapidly decayed by >40% after 200 hours, and 
by >70% after 1000 hours, of storage at 85°C. 
Conversely, after doping Spiro-Li with PPO- 
TEMPO, the cell power output decayed only 
by <20% after 1000 hours of storage at 85°C. 

To show the universality of this polymeric 
redox couple doping strategy, we used PPO- 
TEMPO doping for two other representative 
polymeric hole conductors, PTAA and poly(3- 
hexylthiophene) (P3HT). Both of the doped 
HTLs exhibited higher conductivities than did 
pristine films (fig. S24), realizing more-efficient 
PSCs without any postfabrication treatments 
(see table S4, fig. S25, and supplementary 
note 7 for details). We infer that the doping and 
stabilizing effect of PPO-TEMPO doping on or- 
ganic HTMs could be broadly applicable. The 
highly doped HTLs exhibited better conduc- 
tivity, so the thickness of the HTL in PSCs can 
be increased to >150 nm. Integrated with the 
CPPA-enhanced perovskite layer, the holistic 
strategy for stabilizing the perovskite/HTL 
heterostructure provides promising technical 
routes for fabricating efficient and stable PSMs. 
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Breaking the limitation of polarization multiplexing 
in optical metasurfaces with engineered noise 


Bo Xiong't, Yu Liu’t, Yihao Xu7t, Lin Deng®, Chao-Wei Chen’, Jia-Nan Wang’, Ruwen Peng**, 


Yun Lai’, Yongmin Liu?**, Mu Wang>4* 


Noise is usually undesired yet inevitable in science and engineering. However, by introducing the 
engineered noise to the precise solution of Jones matrix elements, we break the fundamental limit of 
polarization multiplexing capacity of metasurfaces that roots from the dimension constraints of the 
Jones matrix. We experimentally demonstrate up to 11 independent holographic images using a 

single metasurface illuminated by visible light with different polarizations. To the best of our knowledge, 
it is the highest capacity reported for polarization multiplexing. Combining the position multiplexing 
scheme, the metasurface can generate 36 distinct images, forming a holographic keyboard 

pattern. This discovery implies a new paradigm for high-capacity optical display, information encryption, 


and data storage. 


etasurfaces (J-3) provide rich degrees 

of freedom (DOFs) to modulate the 

phase (4, 5), amplitude (6, 7), and po- 

larization (8, 9) of light based on sub- 

wavelength structures. Compared with 
traditional optical devices, metasurfaces possess 
distinct advantages of the ultrathin thickness 
(0, 11), high efficiency (12-14), and broad- 
band responses (15-17). In recent years, metasur- 
faces have been demonstrated for wavelength 
multiplexing (8, 19), angle multiplexing (20), 
and orbital angular momentum (OAM) multi- 
plexing (27), as well as reconfigurable meta- 
devices (22, 23). 

Polarization (24-27) has been explored in 
metasurfaces for large-capacity multiplexing 
technology, which can transmit information 
through independent channels to different 
desired targets. However, there exists an up- 
per limit in the capacity of polarization multi- 
plexing. Traditionally, two orthogonal linear 
polarizations (e.g., x- and y-polarized compo- 
nents) operate dual independent functional- 
ities without cross-talk in a two-dimensional 
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(2D) planar optical platform such as a meta- 
surface (28, 29). This approach relies on the 
diagonal elements of the Jones matrix, which 
is a 2 x 2 matrix representing the optical re- 
sponse of a metasurface. Amplitude modula- 
tion has also been introduced in polarization 
multiplexing. Applying Malus’s law has led to 
the design of metadevices to encode nanoprint- 
ing and holography simultaneously in the 
near- and far-field, respectively (30, 31). More 
recently, the off-diagonal elements of the Jones 
matrix were used as an extra degree of free- 
dom, realizing three independent polariza- 
tion channels (32) and achieving triple sets 
of printing-hologram integrations (33). This 
approach reaches the theoretical upper limit 
(<3 independent channels) of polarization 
multiplexing capacity in 2D planar metasur- 
faces. This restriction applies not only to linear 
polarizations, but also to circular polarizations 
[left circular polarization (LCP) and right cir- 
cular polarization (RCP)] (34, 35), and even 
arbitrary polarizations (36). Although polar- 
ization multiplexing has been substantially 
advanced through physics-guided forward 
design (37, 38) and data-driven inverse design 
based on machine learning (39, 40), the fact 
that the number of independent channels is 
limited to three is a notable constraint in prac- 
tical applications. Breaking this upper limit will 
be crucial for developing high-capacity optical 
displays and data storage. 

We develop an approach based on noise en- 
gineering that breaks this fundamental limit. 
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By introducing the correlated noise with least- 
squares estimation, new polarization channels 
can be created with only moderate cross-talk. 
To reduce the cross-talk, we further introduce 
the noncorrelated noise with a random dis- 
tribution to push the limit of polarization 
multiplexing. Illuminated by the incidence 
with different linear polarization states, the 
metasurface can generate up to 11 indepen- 
dent holographic images at the corresponding 
channels. Simulation shows that the capac- 
ity could be expanded further by combining 
our methodology with position multiplexing 
or OAM multiplexing. As a proof of concept, 
we demonstrate 36 distinct holographic im- 
ages, forming a typical keyboard pattern by 
simultaneously using polarization multiplex- 
ing and position multiplexing. 


Design strategy for engineered noise 


Our design strategy is based on approximate 
solutions of the Jones matrix elements with 
sufficient engineered noise, allowing us to 
overcome the constraint in the polarization 
multiplexing capacity of metasurfaces (Fig. 1). 
The unit cell of the metasurface (Fig. 1A) work- 
ing in the reflective configuration consists of 
multiple rectangular resonators with distinct 
lengths (Z), widths (W), and rotation angles (), 
each of which supports two eigen dipoles along 
the short and long axes (see details in supple- 
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Fig. 1. Schematic of the design of polarization-multiplexed metasurface. 
(A) Designed unit cell and the corresponding response matrix. (B) In step 1, 
the least-squares solutions (solid black arrows) with the correlated noise 
(dashed short arrows) are introduced to create new polarization channels, 
where the target signals (dashed long arrows) are entirely distinct from each 
other. However, as shown in the inset, the solved complex outputs still have 


mentary note 1). The Jones matrix of the unit 
ae | , where Ey, Eny, 
and £, are complex numbers and represent 
the effective Jones matrix elements. They are de- 
termined by the parameter group {L;, Wi, 0;}, 
where 7 indicates the serial number of rectan- 
gular resonators. In the presence of / incident 
light beams with different linear polarizations, 
the corresponding optical responses of the 
unit cell can be generalized to Eq. 1 (details in 


cell is given by | 


supplementary note 2) 
O; cos? 0, sin20, sin? 6, 
Oz cos? 8) sin20, sin” 05 Ex 
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Here Oy denotes the complex reflected field 
at the M-th linear polarization channel along 
the polarization angle 9y (with respect to 
the w axis). Equation 1 is a collection of non- 
homogeneous linear equations that can be 
written as O = A- E. According to linear al- 
gebra (41), Eq. 1 has a precise solution for the 
required Jones matrix elements only if M<3. 
Therefore, the independent channels for po- 
larization multiplexing have an upper limit of 
3, consistent with the conclusion of several 


some correlations, resulting in the moderate cross-talk of holographic images. 
(C) In step 2, the random noncorrelated noise (red shadow circles) is introduced, 
which breaks the correlation between different channels and eliminates the 
cross-talk. (D) Schematic of the polarization-multiplexed metasurface. By 
changing the polarization direction of the incident light, the metasurface can 
generate multiple independent holographic images. 


previous works (33-36) based on the analysis of 
DOFs. When M = 4, the fourth channel is the 
superposition of the first three polarization 
channels (see details in supplementary note 3). 

We deliberately introduce noise into our 
system to break the polarization multiplex- 
ing limit of metasurfaces, offering new DOFs 
in the design space. In the first step, the intro- 
duction of correlated noise creates additional 
polarization channels by considering the least- 
squares solution (47) instead of the precise 
solution of Eq. 1. By solving such overdeter- 
mined linear equations, the correlated noise is 
automatically introduced. The new solution 
can be written as Z = (A™A) ‘ATO (see details 
in supplementary note 4), and thus the com- 
plex outputs O = A-E (e.g., Oi, Oo, O3, O4 
when M = 4) can be expressed by the solid 
black arrows in Fig. 1B. The dashed long ar- 
rows represent the target signals at the cor- 
responding polarization channels, whereas 
the dashed short ones represent the corre- 
lated noise (defined as AO, = |A - E — O)). It 
follows that the target signal of the fourth 
channel is entirely distinct from the other 
three, indicating the generation of new func- 
tionality. However, we emphasize that the 
correlated noise can be theoretically decom- 
posed into the perturbations from the other 
channels (see details in supplementary note 4). 
The outputs at different polarization channels 
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Fig. 2. Design and experimental demonstration of the five-channel polarization- 
multiplexed metasurface. (A) Calculated correlated noise distribution 

based on Eq. 1, which has a mean value of ~0.22n and a standard deviation of 
~0.21n. (B) Added noncorrelated noise distribution with a mean value of 0.397 
and a standard deviation of 0.287. (C) Optical and SEM micrographs of the 
fabricated sample with different magnifications. The scale bars from left to 
right are 40 um, 40 um, 0.5 wm, and 250 nm, respectively. (D) Designed, 

(E) simulated, and (F) measured holographic images with different polarization 
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incidences, which are marked by the purple arrows. All sca 
When the incident light is polarized along 0°, 36°, 72°, 108°, 


e bars are 40 um. 
and 144° the 


metasurface will generate five independent holographic images, including 


“NTS” “&" “E" and “U" (G) Calculated correlation coeffici 
and (H) energy distribution matrix based on the simulation 


ents matrix 
results in (E). 


(I) Measured correlation coefficients matrix and (J) energy distribution 
matrix based on the experimental results in (F), which agree well with 
the simulations. 
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are linearly correlated, resulting in the moder- 
ate cross-talk of holographic images between 
different polarization channels (Fig. 1B, insets). 

To eliminate such cross-talk, we proceed to 
the second step and add the noncorrelated 
noise (details shown in supplementary note 5), 
represented by the red shadow circles in Fig. 
1C. In this case, the new solution is expressed 
as E, =(A™A) ‘470 + 8E, where each com- 
ponent of 5# is randomly sampled from a 2D 
complex normal distribution r x N(0, J). Here 
r indicates the amplitude of the random noise, 
and J is the identity matrix. Then the non- 
correlated noise can be written as AO, = |A*dE]. 
Such noise is similar to the additive Gaussian 
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noise in information theory (42), which mimics 
the effects of many random processes occur- 
ring in nature, including black body radiation 
from Earth and other warm objects, thermal 
vibrations of atoms in conductors (referred to 
as thermal noise), and random manufacturing 
error in experiments. According to the central 
limit theorem (43), the probability distribution 
of the mean noise produced by vast random 
events actually approaches a Gaussian dis- 
tribution. They are typically undesirable but 
ultimately are beneficial in our design. As 
illustrated in fig. S5, when the value of 7 in- 
creases from 0 to 1, the correlation between 
the neighboring channels is gradually broken 
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down by the noncorrelated noise, although the 
intensity in the target channels becomes lower. 
Consequently, the cross-talk between different 
polarization channels diminishes by selecting 
an appropriate value of7, realizing polarization 
multiplexing with a single metasurface (Fig. 
1D). This result highlights that correlated and 
noncorrelated noise have distinct physical 
origins and implications. The correlated noise 
can be passively introduced by solving the over- 
determined linear equations, which creates 
additional information channels with cer- 
tain cross-talk. The intentionally introduced 
noncorrelated noise with a Gaussian noise 
formula substantially reduces the cross-talk, 
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Fig. 3. Simulation and experimental demonstration of the new upper limit. 

(A) Phase diagram for polarization multiplexing in our design. Phase | represents 
the conditions that satisfy the threshold requirements. Phase II represents 

the cases in which the noncorrelated noise is so small that the nondiagonal term 
is more significant than 0.2, indicating that the cross-talk is rather substantial. 

Phase Ill represents the cases in which the noncorrelated noise is so large 

that the diagonal term is larger than 0.8, indicating that the energy in the design 
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channel is rather weak. Phase IV indicates the scenarios where all the threshold 
conditions are not satisfied. The inset in the right panel shows the corresponding 
correlation coefficient matrices for different channel numbers from M = 3 to 11, 
all of which satisfy the threshold requirements. (B) Optical and (©) SEM 
micrograph of the fabricated sample for M = 11 The scale bars are 50 um 

and 0.4 um, respectively. (D) Measured holographic images at 11 linear polarization 
states. The arrows denote the polarization orientations. 
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despite sacrificing the signal-to-noise ratio 
in the primary channels. 


Testing the design strategy 


We first take M = 5 to test our design. Figure 2A 
shows the calculated histogram phase distrib- 
ution of AO, with a mean value of ~0.22n and 
a standard deviation of ~0.21n. Such a noise 
value is relatively low in the metasurface de- 
sign, close to the noise amplitude in four-level 
phase discretization (~0.25z). It follows that 
five holographic images can be generated at 
five polarization states (fig. S6). However, the 
created holographic images still contain some 
cross-talk from the other channels. To suppress 
the cross-talk, we introduce an appropriate 
noncorrelated noise (7 = 0.3) with a mean 
value of ~0.39x and a standard deviation of 
~0.28x (Fig. 2B), which is similar to the case 
of binary phase design. These results dem- 
onstrate that the engineered noise is within 
the margin of the design error. Then we design 
a device with multiplexed holographic images 
(Fig. 2D), following the iterative Fourier trans- 
form algorithm (44) and genetic algorithm (45). 
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Fig. 4. Experimental demonstration of the holographic keyboard. (A) Optical 
and (B) SEM micrograph of the fabricated sample with simultaneously 
polarization- and position-multiplexed design. The scale bars are 50 um and 

0.4 um, respectively. (©) Schematic and (D) measured results for the performance 
of the holographic keyboard pattern. Here we choose nine different polarization 
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The final approximate solutions with engineered 
correlated and noncorrelated noise are fed 
into the genetic algorithm as complex numbers, 
which can search the optimized geometrical 
parameters for each pixel with the smallest 
mean square error. The genetic algorithm is 
critical to the design of the metasurface. By 
mimicking the natural processes of selection, 
reproduction, mutation, and crossover, the ge- 
netic algorithm is a metaheuristic algorithm 
that can solve global optimization problems 
and effectively explore the parameter space (de- 
tails shown in supplementary note 6). Figure 2E 
presents the simulation result in which five 
independent holographic images can be gener- 
ated from the same metasurface when the polar- 
ization of incident light varies. Figure 2C shows 
the optical and scanning electron microscopy 
(SEM) micrographs of the sample (220 um by 
220 um in size) fabricated by standard electron 
beam lithography. The working wavelength of 
the metasurface is 700 nm. From the measure- 
ment results presented in Fig. 2F, we can ob- 
serve that the same metasurface generates 
five independent holographic images—“N”, 
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“J”, “&”, “E”, and “U”—at the corresponding 
polarization states (0°, 36°, 72°, 108°, 144°) 
(details shown in supplementary note 7). 
To quantify the cross-talk between the images 
at different channels, we introduce the corre- 
lation coefficient, which is defined as (46, 47) 


p(T,R) = _COV(T, R) _ (2) 
D(T) - D(R) 

Here 7 and R represent the intensity distribu- 
tion of the target object and reconstructed ob- 
ject, COV(T, R) is the covariance of T and R, 
and D(T) and D(R) represent the variances of 
T and R, respectively. According to this defini- 
tion, if we consider M polarization channels, 
we can calculate the correlation matrix with a 
size of M x M, in which the diagonal terms 
represent the target signal, whereas the non- 
diagonal values indicate the cross-talk. The 
simulated correlation coefficients of the non- 
diagonal terms are all below 0.3 (Fig. 2G), con- 
firming that these polarization channels are 
relatively uncorrelated and hence independent. 
Moreover, up to 60% of the power is directed 
to the targeted channel (Fig. 2H). The measured 
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channels covering from 0° to 160° Four holographic images are simultaneously 
generated at different planes away from the metasurface for each polarization 
channel. The locations of the planes are z, = 330 um, Z2 = 430 um, z3 = 570 um, 
and z, = 830 um. As a result, 36 holographic images can be generated to 
form a typical keyboard pattern. 
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correlation matrix and energy distribution matrix 
in Fig. 2, I and J, also agree well with the sim- 
ulations. Here the fabrication error acts as an 
additional noncorrelated noise component that 
further reduces the cross-talk between the chan- 
nels. These results demonstrate that by com- 
bining AO, and AO,, we can successfully break 
the constraint in the polarization multiplex- 
ing capacity of metasurfaces and realize the 
polarization-multiplexed metasurface with an 
independent channel number larger than 3. 


The upper limit for channel number 


We calculate the phase diagram for polariza- 
tion multiplexing (Fig. 3). The correlated noise 
distributions from M = 2 to M = 12 are first 
solved (fig. S9). The mean value of AO, increases 
slowly from 0 to ~ 0.33 as the channel num- 
ber increases. This is because the least-squares 
solution requires more perturbations to fulfill 
Eq. 1 when M becomes larger. We have also 
swept the value of 7 from 0 to 1, and the cal- 
culated correlation matrices for each scenario 
are shown in fig. S11. It is noteworthy that 
because of the random nature of AO, the cor- 
relation matrices vary from time to time, even 
when the same level of AO,, is applied. To pre- 
vent fluctuation in the results, we take the sta- 
tistical average of 10 correlation matrices. 
Lastly, to evaluate the multiplexing capacity, 
we follow a threshold value of 0.8 (47) for the 
correlation coefficient, resulting in the phase 
diagram shown in Fig. 3A. The examples that 
meet the threshold requirements (diagonal 
terms larger than 0.8 and nondiagonal terms 
less than 0.2 in this study) are marked in red 
(Phase I). By contrast, the rest (Phase II, Phase 
III, and Phase IV) cannot meet the require- 
ments. When the noncorrelated noise is too 
small, the green region (Phase IJ, nondiagonal 
terms larger than 0.2) shows that the cross- 
talk is too strong. When the noncorrelated 
noise is too strong, the blue area (Phase III, 
diagonal terms less than 0.8) indicates that 
the patterns in the targeted channels have low 
intensity. We can see that the new upper limit 
is ~11 with r = 0.5. The upper limit might be 
increased further by relieving the threshold 
conditions (fig. $12). Figure 3, B and C, shows 
the optical and SEM micrographs of the sam- 
ple for M = 11. Figure 3D presents the multi- 
plexed holographic images measured in our 
experiment. By changing the polarization 
angles of incident light from 0° to 163.63° in 
the step of 16.36° each, 11 different holographic 
images are generated from the same meta- 
surface. Despite some background noise, we 
can recognize all the designed Greek letters, 
including a, B, y, 5, p, 9, A, w, a, €, and o, 
demonstrating excellent performance of po- 
larization multiplexing design in the new up- 
per limit. The experimental results agree very 
well with the simulations shown in fig. $13. In 
addition to holograms, we expect that our de- 
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sign methodology can be applied to other meta- 
devices such as metalenses. The key is to choose 
the appropriate parameters as the figure of 
merit and find the phase diagram for the spe- 
cific applications (details shown in supplemen- 
tary note 8). 

Furthermore, our polarization multiplex- 
ing strategy based on noise engineering can 
be combined with other multiplexing meth- 
ods, such as position multiplexing (fig. S14) 
and OAM multiplexing (fig. S15), to boost 
the information-encoding capacity. As in the 
last example, nine linear polarization channels 
(0°, 20°, 40°, 60°, 80°, 100°, 120°, 140°, 160°) 
are chosen, and each channel contains four 
different holographic images at the planes 
330, 430, 570, and 830 um away from the 
metasurface (Fig. 4). The device is 330 um by 
330 um in size. The optical and SEM micro- 
graphs of the metasurface are shown in Fig. 4, 
A and B. The target holographic images for 
the 36 channels at the wavelength of 700 nm 
are 26 letters and 10 Arabic digits, resulting 
in a typical keyboard pattern, as seen in the 
dashed region of Fig. 4C. Figure 4D depicts the 
measured results from the constructed meta- 
surface, which create a total of 36 separate holo- 
graphic images and match the design well. To 
the best of our knowledge, this is the largest 
capacity reported in metasurfaces with polar- 
ization and position multiplexing design, and 
it can be further improved by OAM multiplex- 
ing, wavelength multiplexing, angle multiplexing, 
and other techniques. The design combining 
noise engineering and OAM multiplexing is 
presented in supplementary note 9. 


Concluding remarks 


To summarize, we break the polarization multi- 
plexing limit of metasurfaces with engineered 
noise in the visible region. It is expected that 
the capacity of traditional optical communica- 
tion systems (e.g., wavelength multiplexing) 
and information storage systems (e.g., polar- 
ization multiplexing) can be further boosted 
with more parallel channels by using our ap- 
proach. Furthermore, although we focus on 
metadevices operating in the visible frequen- 
cies, this design strategy can be readily applied 
to infrared, terahertz, or microwave frequencies. 
It opens a new pathway to enhance parallel 
processing, reduce cross-talk between chan- 
nels, and increase the storage capacity in op- 
tical communication, optical encryption, and 
information storage systems. 
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STELLAR ASTROPHYSICS 


Spin-down by dynamo action in simulated radiative 


stellar layers 


Ludovic Petitdemange’+, Florence Marcotte2*+, Christophe Gissinger?**+ 


The evolution of a star is influenced by its internal rotation dynamics through transport and mixing 
mechanisms, which are poorly understood. Magnetic fields can play a role in transporting angular 
momentum and chemical elements, but the origin of magnetism in radiative stellar layers is unclear. 
Using global numerical simulations, we identify a subcritical transition from laminar flow to turbulence 
caused by the generation of a magnetic dynamo. Our results have many properties of the theoretically 
proposed Tayler-Spruit dynamo mechanism, which strongly enhances transport of angular momentum 

in radiative zones. The dynamo generates deep toroidal fields that are screened by the stellar outer 
layers. This mechanism could produce strong magnetic fields inside radiative stars without an observable 


field on their surface. 


s young stars form through accretion, or 

after aging stars have burned through 

their hydrogen fuel, the star’s core con- 

tracts. Conservation of angular momen- 

tum causes the core to spin up (rotate at 
a faster rate), producing strong gradients in 
the angular velocity as a function of radius 
(the rotation profile), a situation known as 
differential rotation. Measurements with stel- 
lar pulsations (asteroseismology) have shown 
that stars at various stages of their evolution 
have internal rotation profiles that are flatter 
than expected from stellar evolution models, 
especially across radiative zones—the layers in 
which the outward transport of energy occurs 
through radiative diffusion, rather than convec- 
tion (/-3). This discrepancy could be resolved 


if there were an unidentified mechanism that 
extracts angular momentum from the stellar 
core and suppresses differential rotation as 
the star evolves (4). 

A potential mechanism for enhanced angu- 
lar momentum transport is stellar magnetism 
(5-7). Magnetic fields on stellar surfaces can 
collimate jets of plasma (8) or power flares (9). 
Theoretical treatments of magnetic fields in 
radiative zone models have shown that they 
modify the predicted dynamics of stars (10-14). 
However, these predictions are limited by two 
theoretical problems. First, magnetic fields are 
difficult to observe in deep stellar layers, in- 
cluding the radiative cores of stars with less 
than ~1.3 solar masses. Even in stars for which 
the radiative zone is located in the envelope 


(outside the core), in most cases the ampli- 
tude of potential magnetic fields falls below 
the detection limits of direct observations of 
stellar surfaces—except for 10% of these stars, 
for which strong dipolar magnetic fields have 
been measured (15). Secondly, the mechanism 
by which a dynamo magnetic field can be gen- 
erated inside a radiative stellar layer remains 
unclear. 


Theoretical predictions 


Dynamo instability is the spontaneous devel- 
opment of an amplification loop by which a 
magnetic field directed toward the poles (po- 
loidal field) is converted into a magnetic field 
parallel to lines of latitudes (toroidal field), and 
vice versa (16). This conversion is mediated by 
plasma motions, which must be sufficiently 
powerful for an initially weak magnetic field 
to undergo self-amplification. In convective 
stellar layers, the required flow complexity can 
be provided by turbulent buoyant plumes 
(17, 18). However, in stably stratified, radiative 


Laboratoire d'Etudes du Rayonnement et de la Matire en 
Astrophysique et Atmosphéres (LERMA), Observatoire de 
Paris, Paris Sciences & Lettres (PSL) Research University, 
French National Centre for Scientific Research (CNRS), 
Sorbonne Université, Paris, France. "Université Cate 

d'Azur, National Institute for Research in Digital Science and 
Technology (Inria), CNRS, Laboratoire J.A. Dieudonné, Nice, 
France. *Laboratoire de Physique de I'Ecole Normale 
Supérieure (ENS), Paris Sciences & Lettres (PSL) Research 
University, CNRS, Paris, France. “Institut Universitaire de 
France, Paris, France. 

*Corresponding author. Email: ludovic.petitdemange@upmc.fr 
(L.P.); florence.marcotte@inria-fr (F.M.); christophe.gissinger@ 
physensfr(C.G) ——————— 
‘FThese authors contributed equally to this work 


a 


10°£ 


Ne 
eas 


Fig. 1. Subcritical dynamo bifurcation. (A) Time series of the magnetic 
energy (quantified by the dimensionless Elsasser number A) at fixed rotation, 
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(25)], but varying magnetic Prandtl number {Pm = [0.35 (blue); 0.42 (red); 
0.5 (yellow); 1 (purple) ]}, ordered as indicated by the dashed black arrow. 
Times (t) are normalized by the Ohmic diffusion time t,, (25). A secondary 
instability occurs when the magnetic energy exceeds a typical value A. ~1 
(dashed black line). (B) Time-averaged magnetic energy density of the saturated 
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dynamo as a function of shear rate (quantified by the Rossby number Ro), 

for Pm = 1) [other parameters the same as in (A)]. Empty diamonds indicate 
linearly unstable solutions, from which the magnetic field grows exponentially. 
Solid diamonds illustrate the hysteresis cycle (black arrows) between a 
nonmagnetic solution (A = 0) and a strong, toroidal dynamo solution. The 
scenario we propose follows the red arrow: the dynamo arises for initially large 
differential rotation, suppresses the shear as the star evolves, and maintains 
the magnetic field below the stability threshold. 
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Fig. 2. Transition to tur- 


bulence. (A) Time series 
of the total kinetic and 
magnetic energy densities. 
The arrow marks the 

time where the amplitude 
of the axisymmetric 
component of the 
azimuthal magnetic field 
(B, locally exceeds 

the prediction of the Tayler 
instability, BS (21). Angle 
brackets denote spatial 


magnetic energy density [erg/cm*] 


averaging in the azimuthal OR 
direction (@ coordinate). 
(B) Radial profiles of the 
azimuthally averaged 
angular velocity (Q in the 
equatorial plane for two 
distinct times, labeled [t1] 
and [t2] in (A), with r; 
and r, as the radii of the 
inner and outer shells, 
respectively. The onset 

of the instability causes 
the rotation profile to 
flatten between [tl] and 
[t2]. Snapshots below are 
of (C and D) the non- 
axisymmetric angular 
velocity in the equatorial 
plane, (E and F) the 
azimuthal magnetic field in 
the equatorial plane and 
(G and H) the same 
quantity in the meridional 
plane. Results are 

shown for the times [t1] 
(panels C, E, and G) 
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and [t2] (panels D, F, and H), before and after the onset of secondary (Tayler) instability. 
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This simulation has parameters E = 10~°, N/Q 


layers, dynamo action (and thus magnetic 
braking) requires a different source of hydro- 
dynamic turbulence. 

Several models have been considered to pro- 
vide angular momentum transport, includ- 
ing internal waves (19) or magnetic instabilities 
(20). An example of the latter approach is the 
Tayler-Spruit (TS) dynamo model (27, 22). In 
this model, magnetic field generation in ra- 
diative layers relies on (i) the winding of a 
poloidal field into a toroidal field by differ- 
ential rotation [the Q-effect (76)] and (ii) the 
destabilization of the resulting strong, toroi- 
dal, and axisymmetric magnetic field by the 
Tayler instability (23), which regenerates a 
poloidal field and thus (in theory) closes the 
dynamo loop initiated by differential rota- 
tion. However, global numerical simulations 
have not produced TS dynamos, casting doubt 
upon whether the simplifications made in 
the theory are valid when the plasma is tur- 
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1.24, Pr 


0.1, Ro = 0.78, and Pm = 1. 


bulent (24). It is therefore unclear whether a 
dynamo mechanism could operate in a stably 
stratified stellar layer. We sought to numer- 
ically investigate whether a magnetic field 
can build up through dynamo instability, trig- 
ger magnetohydrodynamic turbulence, and 
achieve efficient angular momentum transport 
in a radiative star. 


Global numerical simulations 


We model a radiative stellar layer by consid- 
ering the swirling flow of a stratified, nonideal, 
electrically conducting fluid between two co- 
axial, spherical shells spinning at different 
rates. The intensity of the differential rota- 
tion is controlled by the dimensionless Rossby 
number Ro = AQ/Q, where Q and Q + AQ are 
the angular velocities of the outer and inner 
shell, respectively, and the strength of stratifi- 
cation is quantified by the buoyancy frequency 
N (25). 
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When the differential rotation across the 
radiative zone is weak (compared with overall 
rotation), the flow is stable and axisymmetric 
and all velocity perturbations decay away rap- 
idly. For steeper rotation profiles, the rota- 
tional invariance of the flow is broken by the 
destabilization of a free shear layer. In this case, 
Fig. 1A shows the resulting magnetic energy. 
Initially weak magnetic fields undergo expo- 
nential amplification and saturate to a mag- 
netic state, whose final amplitudes can be tuned 
in our simulations by varying the fluid's ratio 
of molecular to magnetic diffusivities (the mag- 
netic Prandtl number Pm). For large values 
of the plasma magnetic diffusivity (Pm < 0.5), 
the shear instability amplifies a laminar and 
mostly axisymmetric toroidal dynamo which 
saturates at weak magnetic energies [the di- 
mensionless Elsasser number A (25) is always 
less than one]. However, if the toroidal energy 
exceeds a transition value A. ~ 1, a secondary 
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Fig. 3. Angular momentum transport. (A) Total dimensionless torque (25) 
exerted on the inner sphere as a function of the dimensionless quantity 

N = r?!? (UgQ)*/? (Nv? (25). Data points are for simulations that result in non- 
magnetic flows (empty circles), weak-dynamos (green stars), and strong- 


instability is triggered: the exponential growth 
steepens, the system becomes turbulent, and the 
magnetic energy saturates at values nearly two 
orders of magnitude greater. We refer to these 
two dynamo solutions as the weak and strong 
dynamos, by analogy with Earth's dynamo 
(26). Figure 1B shows a bifurcation diagram 
that illustrates the subcritical behavior of the 
strong dynamo branch: once the dynamo has 
been triggered for steep rotation profiles, it 
can be maintained even when the shear rate 
is decreased well below that of the onset of 
hydrodynamic instability. 

We propose that, when stars exhaust their 
fuel, the differential rotation across the ra- 
diative zone is large enough (high Ro) for ex- 
tremely weak initial magnetic fields to be 
amplified by dynamo action. This induces 
enhanced outward transport of angular mo- 
mentum, gradually flattening the rotation pro- 
file (decreasing the effective Ro) as the star 
evolves. The magnetic field then dynamical- 
ly adjusts to the smoother rotation profile 
and sustains the turbulent motions on which 
it feeds, thus maintaining the magnetic field 
(Fig. 1B). 

Figure 2 shows snapshots of our simulations 
before and after the steepening of the expo- 
nential growth to illustrate how the strong 
dynamo causes the subcritical transition to 
hydrodynamic turbulence. Steeper magnetic 
growth occurs at the same time as flow de- 
stabilization; after this time, the magnetic 
field has become strongly chaotic and exhibits 
fluctuations at small scale. The corresponding 
velocity field becomes highly turbulent, espe- 
cially in the inner regions of the star. This tran- 
sition to turbulent flow motions suppresses 


differential rotation across the fluid, causing 
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the rotation to flatten (Fig. 2B) as the strong 
dynamo builds up (movie S1). 


Comparison with the Tayler-Spruit mechanism 


The strong dynamo that appears in our simu- 
lations shares several properties with the TS 
mechanism (27). First, this dynamo feeds on 
the interaction between a large-scale, toroidal 
magnetic field and differential rotation. The 
resulting fields have a dominant axisymmetric, 
toroidal component, containing more than 80% 
of the magnetic energy. Second, the spatial 
structure displayed by the magnetic field has 
a smaller length scale in the radial direction 
than in the azimuthal direction. Third, our 
simulations show that strong dynamos arise 
when the maximum amplitude of the axi- 
symmetric component of the azimuthal mag- 
netic field exceeds the local stability threshold 
of the Tayler instability, in agreement with 
theoretical predictions (27). 

However, the TS dynamo loop in our simu- 
lations is initiated differently from the theo- 
retical prediction (27), which may explain why 
TS dynamos have not appeared in previous 
numerical simulations (supplementary text). In 
theoretical models, the finite amplitude toroidal 
field required to trigger the Tayler instability is 
assumed to have grown (linearly) out of an infin- 
itesimal poloidal field, wound up by differen- 
tial rotation. In our simulations, this initial 
step is instead provided by the weak dynamo 
instability, which (exponentially) amplifies the 
toroidal field and kickstarts a subcritical TS 
dynamo. Previous numerical simulations have 
shown that several mechanisms could per- 
form this role in stellar interiors (supplemen- 
tary text). The dynamo transition we identify 
is turbulent, with many unstable modes ex- 


dynamos [red, purple, orange, and yellow symbols (see legend)]. The arrow 
connects two simulations with identical parameters, with and without magnetic 
fields. (B) Results for the magnetic torque only. The purple line is the theoretical 
prediction for the TS dynamo (21). 


cited simultaneously, which is consistent with 
laboratory analogs of the Tayler instability 
(supplementary text). This indicates that a 
fluctuations-based TS dynamo occurs in our 
simulations, in which the axisymmetric field is 
replenished by the mean electromotive force, 
so magnetic fluctuations influence the satura- 
tion mechanism (27). The turbulent nature of 
the transition resolves an issue with the pre- 
dicted TS dynamo model: that the dynamo loop 
cannot be closed with a single nonaxisymmet- 
ric mode becoming unstable at the onset of 
the Tayler instability, because winding up the 
latter would not (alone) replenish the required 
axisymmetric toroidal field (24) (supplement- 
ary text). 


Angular momentum transport 


We quantify the enhanced transport of angu- 
lar momentum to the outer regions of the star 
by measuring the total (G) and magnetic (Gy) 
torques exerted on the swirling fluid. We per- 
formed a range of simulations (table S1) that 
systematically varied the overall rotation rate 
and the stratification of the radiative layers. 
Figure 3 shows the resulting torques for strong 
dynamo action, which follow a transposed ver- 
sion of the theoretically predicted power law 
Gu ~N= Bre! 2 eee — (21), where7; is the inner 
shell radius, v the plasma viscosity, Up the local 
azimuthal velocity measured in the dynamo 
region, and Bis an adjustable parameter (25). 
Weak-field dynamos do not follow this rela- 
tion (Fig. 3). 


Implications for stellar structure 


Our simulations produce a turbulent radia- 
tive dynamo that shares many features with 
the TS model. A strong magnetic field can be 
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sustained by dynamo action inside a stably 
stratified radiative zone, suppressing differ- 
ential rotation and causing spin-down of the 
stellar core. As such, this dynamo provides a 
plausible mechanism to account for the en- 
hanced transport of chemical elements and 
angular momentum in nonconvective stellar 
layers. In particular, it also provides a potential 
additional transport mechanism for the radia- 
tive layers of solar-type stars (28). Helioseismol- 
ogy has shown that the Sun has a flat rotation 
profile in its radiative zone (29). 

The poloidal component of the dynamo 
mechanism we identify is extremely weak, so 
the resulting magnetic fields are almost en- 
tirely toroidal. They are also embedded deep 
in the star's internal layers, where intense dif- 
ferential rotation takes place. The thick stellar 
outer layers screen the magnetic field from the 
surface, preventing direct observations—but 
might potentially be inferred by using as- 
teroseismology (30). Our results therefore 
provide a physical mechanism for enhanced 
transport of angular momentum in stellar in- 
teriors through a dynamo action that produces 
no surface magnetic field. 
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Changing the climate 


ir, I am a climate scientist. Your remarks are unequivocally false.” I listened in admiration as a 
fellow scientist discredited a climate denier in her audience who had spouted false information 
during a Q&A session. As a climate scientist who had recently moved to Texas to start a faculty 
position, I was nervous about having to field similar questions during talks to the general pub- 
lic. I had given those kinds of talks in the past, but they were mostly to groups in more liberal 
states, where I felt I was preaching to the choir. Texas was a different story. 


I have always viewed science com- 
munication as one of the most im- 
portant and urgent parts of my 
job—and a main reason I wanted to 
become an expert in climate science 
in the first place. As a Ph.D. student 
in California, I leveraged my train- 
ing by preparing climate change 
summaries for a local congressman. 
I coached elementary school teach- 
ers in how to teach students about 
climate. And I worked with young 
women to empower them to work 
on climate solutions. 

It felt good to step outside aca- 
demia and share my expertise on 
issues that were important to me. 
But at the same time, I sometimes 
wondered: Am I really making a 
difference? When I spoke, I was 
almost always met with knowing 
nods and immediate moves to action. In Rhode Island, too, 
where! did my postdoc, my audiences were already on board. 

Then, I moved to Texas. My partner had landed a job there, 
and I was able to secure a postdoc nearby followed by a fac- 
ulty position. I was excited at the prospect of engaging in 
outreach with people who weren't yet concerned about the 
climate. But I wasn’t sure what to expect—or how I'd deal 
with audiences that were skeptical of what I had to say. 

I started by doing what I’d always done in the past— 
calling local organizations and offering to work with them. 
But this time, it wasn’t always smooth sailing. I spoke on pan- 
els where I was insulted, threatened by elected officials, and 
talked down to. Unaccustomed to those kinds of attacks, I 
became flustered and angry, and sometimes it showed. 

Those experiences were stressful and emotionally taxing. 
But I had to learn to deal with them if I wanted to make an 
impact. Luckily, I found that the state’s network of climate 
scientists was close-knit. I immediately had colleagues from 
other universities introducing themselves and inviting me to 
give talks. They had advice about how to deal with hostility 
and misinformation. “Don’t get visibly frustrated,’ one told 


“L explained the state of the 
science and gently but directly 
corrected falsehoods.” 


me. “Stay as professional as possible. 
Be polite but be firm in correcting 
inaccurate statements,” another said. 

During a panel discussion a few 
months later, I was able to call on 
their advice. When a fellow panel- 
ist made erroneous statements, I 
explained the state of the science 
and gently but directly corrected 
falsehoods. Zoom worked to my ad- 
vantage: I placed figures in the chat 
to fact-check misstatements. When 
I sensed the conversation moving 
in a bad direction, I didn’t become 
confrontational—I_ simply spoke 
when it was my turn and went back 
to the facts. Although I might not 
have changed the other panelist’s 
mind, I hope I got through to at 
least some audience members. 

As time wore on, I became more 
comfortable dealing with those kinds of situations. And I 
started to truly feel that I was making a difference. During a 
Girl Scout event, parents came up to me to say they’d never 
learned about climate science before, with one remarking, “I 
know you are here to teach the girls, but it was really valu- 
able for the parents, too.” A nonprofit organization founded 
by Republican women also reached out to me, and I’ve been 
informally advising one of their members—a former state 
representative—who is preparing to speak with Republican 
voters about climate change. 

Speaking to audiences in California and Rhode Island may 
have been more comfortable. But I am glad that I now have 
the opportunity to do this work where it really matters—in 
places where I am not preaching to the choir. I finally have an 
opportunity to help mend a harmful divide. We all need clean 
air and a stable climate. And so, I encourage my fellow sci- 
entists who are interested in public outreach to run toward, 
rather than away from, these kinds of opportunities. Talk to 
everyone you can find. People are listening. 


Sylvia Dee is an assistant professor at Rice University. 
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